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FOREWORD 


The  Target  Acquisition  Symposium  held  at  the  Naval  Training  Center,  Orlando,  Florida, 
on  14,  15,  16  November  1972  was  sponsored  by  the  Engineering  Psychology  Programs,  Office 
Of  Naval  Research,  Washington,  D.  C.  and  was  conducted  by  Martin  Marietta  Aerospace, 

Orlando,  Florida  under  ONR  contract  No.  N00001 4-72-C-0389 .  The  proceedings  are  contained  in 
two  volumes : 

Vol  I.  A  Collection  of  Unclassified  Papers  on  Target  Acquisition 

Vol  XI.  A  Collection  of  Classified  Papers  on  Target  Acquisition  (U) 

The  papers  appear  in  each  volume  in  chronological  order  as  presented.  The  symposium 
was  held  in  six  sessions  moderated  as  follows: 

Session  1  -  Dr.  James  J.  Regan 

2  -  Dr.  Antonio  Mendez 

3  ~  Dr.  Harry  L.  Snyder 

4  -  Dr.  Daniel  B.  Jones 

5  -  Dr.  John  J.  O’Hare 

6  -  Dr.  Martin  A.  Tolcott 

A  brief  biographical  sketch  of  each  Panel  Moderator,  Panel  Participant,  and  Author 
is  included  at  the  end  of  each  volume. 

Special  acknowledgement  is  made  for  the  personal  sponsorship  of  the  Symposium  by  Dr. 
Martin  A.  Tolcott,  ONR,  and  to  James  L.  Carpenter  and  Kathryn  W.  Leonard  of  Martin 
Marietta  for  the  difficult  tasks  of  preparing  for  and  conducting  the  symposium. 


Daniel  B.  Jones,  Editor 
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TARGET  ACQUISITION  SYMPOSIUM 
OPENING  REMARKS 
by 

Fred  A.  Payne 

Vice  President  and  Chief  Engineer 
Martin  Marietta  Aerospace 
Orlando  Division 


PROBLEM 

The  problem  of  accurately  detecting 
and  recognizing  targets  with  either  the 
unaided  eye,  or  even  with  the  use  of 
sophisticated  sensing  devices,  seriously 
limits  the  design  and  operation  of  recon¬ 
naissance,  fire  control,  and  air  launched 
missile  systems.  The  state-of-the-art  in 
missile  design  and  guidance  systems  has 
far  outstripped  the  ability  of  the  human 
operator  to  acquire  ground  targets  at 
maximum  launch  ranges.  The  requirements 
for  low  level  high  speed  attack  profiles 
also  compounds  the  crew’s  difficulty  in 
acquiring  targets  at  safe  distances.  For 
example,  current  optical  correlator  weapons 
systems  are  limited  in  the  acquire-before- 
launch  mode  by  the  pilot’s  ability  to 
detect  targets.  If  the  potential  of 
sensor  and  missile  technology  is  to  be 
exploited,  real  time  target  detection 
and  recognition  will  have  to  be  improved 
by  means  of  more  sophisticated  sensing/ 
display  systems  and  the  associated  man/ 
display  interfaces. 

The  advent  of  "Smart"  weapons  has 
made  the  target  acquisition  problem  very 
real.  Today  we  can  accurately  guide  tac¬ 
tical  weapons  to  small,  hard  targets. 

Many  of  these  weapons  can  be  released  at 
slant  ranges  beyond  50,000  feet.  Guidance 
accuracies  of  less  than  five  feet  can  be 
achieved  with  CEP's.  Direct  hits  are 
being  achieved  on  tanks  or  trucks  from 
ranges  well  beyond  any  previous  capability. 
Even  more  accurate,  longer  range,  and  more 
diverse  weapons  are  in  development.  But 
our  detection  systems  still  do  not  match 
our  weapons  capability. 


For  years  we  have  relied  on  the 
pilot's  or  observer's  unaided  visual 
capability  to  acquire  targets.  But 
the  human  "Mark  I  eyeball"  has  not 
been  improved.  Unassisted  eye  detec¬ 
tion  is  physiologically  limited.  Under 
ideal  conditions  our  reliable  visual 
target  acquisition  performance  will 
be  marginal.  Experimental  evidence 
shows  that,  even  on  a  clear  day,  a 
static  target  twenty  feet  long  with 
high  contrast  will  be  found  by  an 
observer  flying  at  400  knots  speed  at 
ten  thousand  feet  altitude  with  a  high 
probability  of  detection  at  less  than 
20,000  feet  slant  range.  Deduct  air¬ 
craft  trim  up  and  weapon  release  times 
from  that  distance  and  we  are  well  in¬ 
side  the  range  of  accurate  enemy  anti¬ 
aircraft  weapons.  If  the  target  is 
really  "tactical"  -  small,  mobile,  and 
camouflaged  -  then  we  probably  will  miss 
it  entirely. 

Our  current  operational  family  of 
electro-optical  and  electronic  sensors 
do  not  significantly  improve  the  situa¬ 
tion.  While  we  can  detect  some  targets 
with  TV,  FLIR's  or  Radar  at  greater 
than  eyeball  ranges,  these  longer  range 
systems  are  restricted  by  narrow  fields 
of  view  and  limiting  resolution.  None 
of  them  approach  the  Mark  I  eyeball  in 
resolution  or  in  field  of  view.  If  we 
know  where  to  point  the  system,  then 
we  may  have  a  high  probability  of  target 
acquisition  out  to  perhaps  40  thousand 
feet.  This  still  means  that  the  real 
tactical  targets  are  usually  missed 
completely,  since  only  a  small  portion 
of  the  potential  target  area  is  viewed. 
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Our  dilemma  can  be  graphically  illus¬ 
trated.  Typical  air-to-ground  weapon  per¬ 
formance  is  shown  in  Figure  1.  Target 
acquisition,  even  by  our  best  sensor  equip¬ 
ment  and  weapon  release,  means  the  pilot 
will  be  flying  in  the  area  marked  acquisi¬ 
tion  (Figure  1(a)). 

Even  small  anti-aircraft  weapons  fire 
out  to  nearly  20,000  feet  and  medium  AA 
guns  to  around  40,000  feet  (Figure  1(b)). 


If  we  add  enemy  SAM's  to  this  situation 
the  attacking  aircraft  is  subject  to  fire 
long  before  he  can  possibly  acquire  the  tar¬ 
get  (Figure  1(c)).  Our  weapons  capability 
will  allow  us  to  attack  from  relatively 
safe  distances,  but  we  cannot  find  the 
targets . 

The  costs  of  this  problem  are  very 
real.  We  can  successfully  destroy  tactical 
targets  at  relatively  safe  ranges,  but  our 


Figure  1.  Target  Acquisition’s  Problem 


Figure  1(a) 


A 


Figure  1(b) 


Figure  1(c) 


acquisition  technology  does  not  match.  In 
Viet  Nam  we  still  rely  upon  visual  target 
acquisition  for  tactical  targets.  This 
causes  the  attrition  rate  for  the  Forward 
Air  Control  aircraft,  which  are  used  to 
acquire  targets ,  to  be  the  highest  of  any 
aircraft  group  in  the  theater. 

This  gentlemen  is  our  problem.  How 
can  we  find  targets  at  ranges  commensurate 
with  our  weapons' capability? 


In  the  next  three  days  we  will  be  con¬ 
sidering  some  ways  to  solve  that  problem. 
There  are  four  elements  in  any  solution: 

1)  the  sensors;  2)  the  data  processor; 

3)  displays;  4)  the  human. 


Let’s  all  keep  a  mental  box  score 
during  the  symposium  to  determine  where 
the  effort  is  and  is  not  being  invested. 
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OBJECTIVE 


e 


The  objective  of  this  symposium  on 
target  acquisition  is  first,  to  assess 
the  state-of-the-art  in  acquisition 
technology?  second,  to  establish  a  dialog 
among  the  many  groups  represented  here  - 
from  the  theoretician  to  the  crew  member 
who  ultimately  must  use  the  system  -  and 
to  create  an  understanding  of  all  of  the 
interrelated  problems.  We  hope  also 
that  you  may  pose  some  new  approaches 
to  solving  these  problems.  What  we 
need  are  practical  solutions  to  a  long 
standing  problem. 

The  symposium  is  oriented  toward 
the  human  factors  aspect  because  target 
acquisition  eventually  comes  down  to 
the  crewman  trying  to  do  a  job.  How¬ 
ever,  all  of  us  concerned  with  the 
problem  are  represented:  pilots,  engineers, 
designers,  operations  analysts,  and  human 
factors  specialists. 

SCOPE 

The  scope  of  the  three  day  session 
reflects  our  diversity  of  interest: 

-  Operational  Techniques, 

Countermeasures  to  Target 

Acquisition , 

Sensor  Systems , 

-  Operational  and  Systems 

Models,  and 

Image  and  Display  Systems. 

Our  symposium  is  planned  to  con¬ 
sider  each  of  these  areas  from  the 


perspective  of  our  several  divergent 
technical  interests .  We  hope  to  em¬ 
phasize  real  problems  and  practical 
solutions . 


THANKS 

We  at  Martin  Marietta  are  especially 
pleased  that  the  Office  of  Naval  Research 
has  shown  a  continuing  interest  in  these 
practical  solutions.  We  believe  that  the 
applied  research  in  this  area,  that  has 
been  sponsored  by  ONR,  will  result  in 
better  and  more  effective  target  acquisi¬ 
tion  systems. 

« 

We  particularly  wish  to  thank: 

-  Dr.  Martin  Tollcott,  Director 
of  ONR's  Engineering  Psychology 
Programs,  for  his  direct  interest 
that  made  this  meeting  possible. 

Captain  J.  F.  Gillooly,  Commander 
of  the  Orlando  Naval  Training  Center, 
who  has  graciously  provided  us  with 
the  facilities  and  support  necessary 
to  conduct  our  meeting. 

Captain  A.  G.  Finley,  Commanding 
Officer  of  the  Naval  Training  Equip¬ 
ment  Center  who  has  lent  the  services 
and  time  of  his  personnel  to  assist 
us . 

These  gentlemen,  and  myself,  wish  to  wel¬ 
come  you  here  today.  Find  the  Target  and 
keep  your  attention  there  for  these  next 
three  days. 

I  will  now  turn  you  over  to  our  symposium 
Chairman,  Dr.  Daniel  B.  Jones. 
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THE  EMETF  WEAPON  SYSTEM  ELECTROMAGNETIC  ENVIRONMENT  SIMULATOR 


R.  L.  McCluskey 

U.  S.  Army  Electronic  Proving  Ground 


ABSTRACT 

This  paper,  presented  as  a  slide  film 
briefing,  delineates  the  capabilities  and 
operational  modes  of  a  Weapon  System  Elec¬ 
tromagnetic  Environment  Simulator.  This 
simulator  is  part  of  the  measurement  facil¬ 
ities  of  the  Electromagnetic  Environmental 
Test  Facility  (EMETF)  of  the  U.S.  Army 
Electronic  Proving  Ground  at  Fort  Huachuca, 
Arizona.  In  this  discussion  the  capabilities 
and  operation  of  a  mathematical  interference 
prediction  model,  as  well  as  other  test 
facilities  of  the  EMETF,  are  related  to  the 
Weapon  System  Electromagnetic  Environment 
Simulator.  This  relation  of  prediction  and 
simulation  software  to  the  empirical  testing 
of  component  and  data  links  in  testing  is 
explained.  The  development  history  of  the 
EMETF  and  plans  for  a  future  interactive 
and  integrated  test  capability  conclude  the 
briefing. 

The  purpose  of.  this  unclassified  brief¬ 
ing  is  to  acquaint  you  with  the  Weapon  Sys¬ 
tem  Electromagnetic  Environment  Simulator, 
the  WSEES ,  In  addition,  we  will  discuss 
the  capability  of  the  EMETF  to  evaluate  the 
electromagnetic  compatibility  of  a  total 
weapon  system. 

The  WSEES  is  capable  of  simulating,  under 
controllable  and  repeatable  conditions,  the 
exact  radio  frequency  signals  associated 
with  a  radar  or  weapon  system.  Simultan¬ 
eously  it  can  simulate  signals  associated 
with  other  weapon  systems,  ECM  devices,  and 
communications-electronics  systems  that  might 
degrade  the  performance  of  the  system  being 
evaluated . 

This  simulator  is  part  of  the  measurement 
facilities  of  the  Electromagnetic  Environ¬ 
ment  Test  Facility  of  the  U.S.  Army  Elec¬ 
tronic  Proving  Ground  at  Fort  Huachuca,  Ari¬ 
zona,  a  testing  activity  of  U.  S.  Army  Test 
and  Evaluation  Command.  The  mission  of  the 


EMETF  is  to  determine  the  electromagnetic 
compatibility  -  EMC  -  of  Army  communications- 
electronics  equipment,  systems,  and  concepts. 
To  place  the  WSEES  in  the  context  of  the 
EMETF* s  mission,  I  will  give  a  brief  sum¬ 
mary  of  the  EMETF* s  history  and  the  require¬ 
ments  for  the  WSEES. 

* ‘Electromagnetic  compatibility**  is 
defined  in  AR  11-13.  The  key  words  are 
* ‘intended  operational  environments**  and 
‘ ‘unwanted  electromagnetic  radiation  or 
response**.  This  includes  both  intentional 
and  unintentional  interference. 

Electromagnetic  compatibility  became  an 
issue  because  of  the  Army’s  increasing  use 
of  weapons,  radios,  and  other  devices  using 
radio  frequencies  in  a  limited  portion  of  the 
electromagnetic  spectrum.  A  study  was  con¬ 
ducted  in  the  1950* s  to  identify  any  prob¬ 
lems  that  might  result  from  this  trend  and  to 
devise  solutions  to  such  problems.  The  study 
was  approved  by  the  Secretary  of  the  Army  in 
1958.  It  resulted  in  the  establishment,  in 
1960,  of  the  Electromagnetic  Environmental 
Test  Facility  -  or  EMETF. 

Initially,  EMETF  testing  was  con¬ 
ducted  in  a  field  facility.  This  field 
facility  was  to  contain  a  U.S.  corps  com¬ 
posed  of  three  on-line  infantry  divisions 
and  a  reserve  armored  division  in  a  defensive 
posture  along  the  Gila  River.  Deploying 
and  evaluating  the  required  communications- 
electronics  systems  in  this  facility  soon 
proved  to  be  both  costly  and  time  consuming, 
and  was  plagued  by  technical  measurement 
and  control  problems.  These  difficulties 
led  to  the  present  concept  of  operation  of 
the  EMETF. 

To  make  the  many  measurements  required 
by  our  diverse  analyses,  the  EMETF  uses  six 
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interrelated  facilities:  the  Field  Facility, 
where  we  measure  equipments  under  field 
conditions;  the  Instrumented  Workshop, 
where  we  perform  laboratory- type  measure¬ 
ments  and  closed  link  tests;  the  Spectrum 
Signature  Facility,  where  we  analyze  spec¬ 
trum  characteristics  of  emitters;  the 
Scoring  Facility,  where  we  measure  informa¬ 
tion  transfer  of  communications-electronics 
(CE)  systems;  and  the  WSEES,  where  we  de¬ 
termine  the  effects  of  interference  on 
weapon  systems  and  radars  (Figure  1). 

Finally,  in  the  computer-automated  Interfer¬ 
ence  Prediction  Model,  we  use  data  from  these 
five  measurement  facilities  to  perform 
large-scale  analyses  of  CE  tactical  deploy¬ 
ments  and  operational  concepts. 


Figure  1 .  EMETF  Operational  Facilities 


The  Interference  Prediction  Model,  or 
IPM,  is  a  major  analysis  tool  of  the  EMETF 
and  requires  basic  types  of  input  data.  The 
test  bed  provides  information  on  equipment 
types  and  locations,  equipment  linking  and 
netting,  and  other  pertinent  data  concerning 
the  deployment  and  operational  concept  being 
evaluated.  The  characteristics  of  the 
equipments  being  evaluated  are  provided  by 
the  spectrum  signature  data.  The  propaga¬ 
tion  model  provides  propagation  path  loss 
data.  Scoring  data  are  performance  infor¬ 
mation  on  a  particular  equipment  that  enables 
us  to  rate  its  effectiveness  with  and  with¬ 
out  interference  or  jamming.  We  determine 
scoring  data  by  empirical  measurements  under 
controlled  conditions.  The  IPM  then  deter¬ 
mines  the  equipment  performance  in  the 
tactical  environment  both  with  and  without 
interference . 

Communications  equipment  is  subject  to 
interference  from  any  signal  on  the  same 


frequency.  The  desired  signal  when  combined 
with  a  sufficiently  powerful  interfering 
signal  is  likely  to  become  entirely  unin¬ 
telligible.  Measuring  these  relative  signal 
to  interference  levels  in  our  Instrumented 
Workshop  yields  the  scoring  data  needed 
(Figure  2) . 
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Figure  2.  Communications  Interference 


A  radar  receiver,  however,  may  not 
‘‘notice*’  a  signal  other  than  its  desired 
signal,  even  when  the  two  are  on  the  same 
frequency,  if  their  timing  and  phasing 
differ.  The  upper  part  of  Figure  3  illus¬ 
trates  this  type  of  situation.  The  lower 
part  shows  interfering  pulses  in  time  phase 
with  some  of  the  desired  signal  pulses. 

When  the  signals  are  in  time  phase,  inter¬ 
ference  may  occur.  When  they  are  not  in 
time  phase,  interference  may  not  be  notice¬ 
able  even  if  the  undesired  signal  is  much 
stronger  than  the  desired  signal.  Obtain¬ 
ing  data  for  scoring  the  effectiveness  of 
radars  and  weapon  systems  thus  require  more 
complex  equipment  and  procedures  than  for 
communications  equipment. 
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Prior  to  development  of  the  Weapon 
System  Electromagnetic  Environment  Simula¬ 
tor,  the  WSEES,  scoring  data  on  radars  and 
weapon  systems  had  to  be  obtained  by  field 
testing.  Not  only  was  this  time  consuming 
and  laborious,  but  data  were  frequently 
difficult  to  verify  because  of  the  non- 
repeatable  nature  of  field  conditions,  and 
costs  were  often  exorbitant. 

We  therefore  developed  a  facility  simi¬ 
lar  in  concept  to  the  Instrumented  Workshop, 
and  capable  of  controlled  and  repeatable 
testing  of  radars  and  weapon  systems.  This 
simulator,  the  WSEES,  enables  us  to  analyze 
the  EMC  of  such  equipment  without  elaborate 
field  tests. 

The  simulator  had  to  have  the  capability 
of  simulating  the  RF  signals  associated  with 
air  defense,  shell  and  mortar  tracking, 
and  combat  surveillance  radars.  These 
signals  include  pulse,  CW  and  both  pulse 
and  CW  doppler. 

Figure  4  shows  the  overall  WSEES  includ¬ 
ing  both  hardware  and  software.  Since  the 
EMETF  uses  the  Interference  Prediction  Model 
or  IPM  to  determine  equipment  performance  in 
the  tactical  situation,  the  primary  pur¬ 
pose  of  the  WSEES  is  to  provide  data  on  the 
effects  of  interference  on  radars  and 
weapon  systems.  Therefore,  we  can  use  the 
general  purpose  computer  at  Fort  Huachuca  to 
prepare  our  control  tapes.  These  are  then 
used  to  control  the  RF  signals  generated 
by  the  basic  simulator. 


Figure  4.  Overall  WSEES  System 


Figure  5.  Present  WSEES  Equipment 


The  digitally  controlled  RF  generator 
section  contains  three  RF  generators,  each 
operating  in  three  bands  from  2  to  12.4 
GHz  (Figure  6) .  Two  of  these  generators 
may  be  phaselocked  to  the  equipments  under 
test  to  provide  coherent  signals.  Each 
generator  can  be  time-shared  to  simulate 
several  different  emitters.  This  permits 
simulating  up  to  32  pulsed  emitters.  The 
doppler  shift  range  of  30  to  100  kHz 
allows  simulation  of  a  range  of  targets  from 
a  walking  man  to  a  mach  9  missile. 


•  FREQUENCY  RANGE  {GHz) 

■  OVERALL  2  0  TO  12  4 

RANGE  1.  2  0  TO  4  0 
RANGE  2.  40  TO  8  0 
RANGE  3.  8.0  TO  12.4 

•  FREQUENCY  GENERATORS  3 

■  2  RF  GENERATORS  ARE  COHERENT 

{PHASE  LOCKED) 

•  NUMBER  OF  SIGNALS-32 

■  12  SIGNALS  INTERNALLY  CONTROLLED 

■  20  SIGNALS  EXTERNALLY  CONTROLLED 


■I 


Figure  6.  WSEES  Characteristics 


The  WSEES  has  a  highly  versatile  pulse 
generation  capability.  Pulses  can  have 
widths  from  0,1  to  200  microseconds  at 
repetition  rates  from  20  to  50,000  pulses 
per  second.  The  individual  pulses  can  be 
controlled  in  rise  time,  fall  time,  and 
pulse  width  to  simulate  various  signals 
in  the  environment. 


The  present  WSEES  equipment  is  shown  in 
Figure  5.  On  the  right  are  the  tape  input 
units;  in  the  center,  the  manual  control 
unit  and  digital  interface.  The  two  racks 
on  the  left  contain  the  RF  generation  equip¬ 
ments  . 


Additional  characteristics  are  shown 
in  Figure  7.  The  WSEES  is  capable  of 
producing  chirped  pulses  and  pulse  burst 
patterns  as  indicated.  Because  the  present 
WSEES  contains  only  three  RF  generators,  it 
is  limited  to  producing  three  continuous 
wave  or  CW  signals. 
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•  CHIRP 


•  PULSE  BURST 

•  CW  EMITTER 


•  RANGE  DELAYS 


MAX  RATE 
2  5M  Hz/us 


3 

PULSE 

PATTERN 

GENERATOR 


3. 

MAXIMUM 


immm 

INTERNALLY 

CONTROLLED 

SIGNALS 


Figure  7.  WSEES  Special  Features 

A  functional  diagram  of  RF  generation 
control  is  shown  in  Figure  8.  The  coarse 
frequency  control  is  applied  to  the  cathode 
of  the  backward  wave  oscillator  (BWO)  and 
the  fine  frequency  control  to  the  helix. 

The  fine  frequency  control  also  controls 
the  chirp  rate  of  a  pulse,  if  required. 

The  switch  network  between  the  BWO  and  the 
traveling  wave  tube  (TWT)  provides  the  pulse 
width  control.  The  TWT  amplifies  the  sig¬ 
nal  and  also  provides  the  pulse  shape  con¬ 
trol.  Finally,  the  attenuators  set  the 
specified  power  level. 


Figure  8.  RF  Generation  Control 

Figure  9  is  a  block  diagram  of  the  Phase 
I  WSEES.  The  RF  generation  control  just 
discussed  is  shown  in  the  two  blocks 
labeled  ‘‘RF  Generators’*  and  ‘‘Power  Con¬ 
trols.’  *  The  control  information  is  pro¬ 
vided  to  the  digital  interface  by  the 
manual  input  device  or  the  tape  units.  The 
digital  interface  stores  information  on  as 
many  as  32  emitters,  and  passes  these  data 
to  the  RF  generators  when  needed.  The  digi¬ 
tal  control  is  shown  by  the  thin  lines  and 
the  RF  signal  by  the  heavy  line. 

Figure  10  depicts  a  typical  situation 
that  could  be  simulated  by  the  WSEES.  This 
problem  also  demonstrates  the  versatility  of 
the  system.  The  test  radar  system,  in  the 
foreground  is  tracking  the  target  aircraft 


and  controlling  the  missile,  which  is  in 
flight  to  intercept  the  target.  The  target 
has  fired  a  decoy  forward  to  confuse  the 
radar  and  missile  and  is  also  using  ECM 
against  both  the  radar  and  missile.  A 
background  radar  is  also  tracking  the  target 
but  has  not  launched  an  intercept  missile. 


Figure  9.  Phase  I  WSEES 


The  signals  to  be  generated  include 
those  reflected  to  the  radar  from  the  tar¬ 
get  and  the  control  signal  from  the  radar 
to  the  missile. 

Possible  interfering  signals  from  the 
background  radar  are  those  reflected  from 
both  target  and  missile  to  the  test  radar 
and  also  from  target  to  missile.  There 
may  also  be  a  direct  signal  path  from  the 
second  radar  to  the  first. 

We  must  also  consider  the  use  of  an 
electronic  countermeasures  system,  or  sys¬ 
tems,  against  both  the  radar  and  the  missile 

Combining  all  these  presents  a  complex 
electromagnetic  environment  that  can  be 
expected  in  a  typical  situation.  The  WSEES 
can  simulate  all  of  these  signals.  In  addi¬ 
tion,  it  can  simulate  other  signals  in  the 
environment  that  could  cause  interference. 
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Future  weapon  systems  may  have  a  multiple 
reaction  capability.  They  will  be  able  to 
adapt  to  interference  or  jamming  by  changing 
frequencies  and  control  modes  and  other  ECCM 
techniques.  In  the  example  shown  in  Figure 
11,  if  the  missile  were  using  semi-active 
homing,  it  could  lock  on  and  track  the 
decoy,  and  would  therefore  miss  the  target. 
However,  the  ground  radar  by  tracking  the 
target,  the  missile,  and  the  decoy  could 
ascertain  that  the  missile  was  veering 
away  from  the  target.  The  ground  unit  could 
then  provide  guidance  commands  to  bring  the 
missile  back  on  target. 


To  permit  rapid  testing  of  these  advanced 
systems,  a  Phase  II  development  effort  is 
underway  to  add  a  real-time  control,  a 
steerable  platform  and  other  features  in 
Figure  12.  I  will  discuss  the  application 
of  each  of  these  improvements. 


Figure  12.  Phase  II  Development  Effort 


The  real  time  controller  is  the  heart 
of  the  system  (Figure  13).  It  operates  in  a 
feed-back  loop  with  the  system  under  test. 

In  its  normal  mode  of  operation,  the  WSEES 
would  generate  a  signal  environment  as 
commanded  by  the  precalculated  tapes  for  the 


operational  situation  being  simulated.  When 
the  operational  mode  for  the  system  under 
test  changes,  the  real-time  controller  would 
sense  this  change  and  alter,  as  appropriate, 
the  signals  being  generated.  To  accomplish 
the  signal  changes  or  modifications  required, 
the  real-time  controller  must  not  only  sense 
that  a  change  in  the  operational  situation 
has  occurred,  but  must  adjust  the  signals 
to  reflect  the  change. 


Figure  13.  The  Real  Time  Controller 


The  steerable  platform  permits  simu¬ 
lation  and  control  of  the  spatial  position 
of  the  missile.  By  monitoring  the  signals 
to  the  missile  control  surfaces  in  respect 
to  the  calculated  space  position  and  speed 
of  the  missile,  the  real-time  controller 
can  change  simulation  parameters  to  reflect 
changes  in  missile  position. 

Four  additional  items  are  shown  in 
Figure  14  —  a  remote  output  signal  control, 
a  RAM  enclosure,  an  event  recorder,  and  a 
rapid  manual  input  and  display. 
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The  remote  output  signal  control  will 
provide  four  independently  controlled  out¬ 
puts  for  each  of  the  three  RF  channels. 

This  will  permit  direct  cabling  to  the. 
four  inputs  of  a  monopulse  system,  and  also 
increase  the  number  of  signal  output  ports 
from  three  to  twelve. 

The  radiation  absorbent  material  enclo¬ 
sure,  the  RAM,  will  house  radiating  antennas 
which  are  required  for  tests  when  we  can’t 
direct  wire  into  the  receiver. 

The  event  recorder  provides  a  record 
of  all  test  conditions  and  any  changes 
made. 

The  rapid  manual  input  and  display  will 
allow  the  test  situation  to  be  entered 
manually  and  changes  made  by  the  test  engi¬ 
neer  without  computer  generated  test  tapes. 

Figure  15  is  an  artists’s  conception  of 
the  control  panel  for  the  manual  input.  The 
initial  parameter  data  for  each  emitter 
can  be  entered  by  tape,  punched  card,  or 
manually.  The  operator  can  check  all  data 
by  manually  selecting  each  emitter  and  ob¬ 
serving  the  parameter  data  for  that  emitter. 
The  emitter  number  is  displayed  in  the 
center  window  and  the  parameter  data  in 
the  windows  on  the  left.  By  depressing  the 
appropriate  plus  or  minus  button  associated 
with  each  parameter  display,  the  operator 
can  manually  change  that  parameter  as 
desired. 


Figure  15.  Rapid  Manual  Input 
Controller  Panel 


The  Weapon  System  Electromagnetic  En¬ 
vironment  Simulator  was  designed  primarily 
to  evaluate  the  EMC  of  that  portion  of  a 


weapon  system  utilizing  radar  type  sig¬ 
nals.  However,  a  total  weapon  system  has 
many  other  elements. 

To  evaluate  the  voice  and  data  com¬ 
munications  of  this  overall  system,  the 
EMETF  is  currently  installing  a  new  Auto¬ 
matic  Data  Collection  System  which  can  auto¬ 
matically  test  every  element  of  the  voice 
and  data  transfer  system  associated  with  an 
air  defense  system. 

The  heart  of  the  automatic  data  collec¬ 
tion  system  is  the  digitally  controlled 
automatic  measuring  equipment.  With  either 
manual  or  taped  control,  these  equipments 
can  automatically  measure  and  record  all 
of  the  parameter  and  test  data  associated 
with  a  communications  test  link,  to  include 
the  effects  of  interference  or  jamming. 

Many  future  systems  may  also  include 
so  called  electro-optical  elements  and 
laser  communications  as  elements  of  an  air 
defense  system. 

To  evaluate  these  systems  the  EMETF  is 
in  the  final  design  phase  of  an  electro- 
optical  test  facility.  This  facility  will 
analyze  the  performance  of  these  and  other 
sensors  in  the  EO  spectral  region. 

In  summary,  the  EMETF  found  that  field 
measurements  of  military  communications- 
electronics  systems  were  prohibitively  slow, 
variable,  and  expensive  (Figure  16).  We 
developed  the  WSEES  to  simulate  radar  and 
weapon  system  emissions.  Measurements  made 
from  WSEES  simulations  are  fast,  repeatable, 
controllable,  and  inexpensive.  We  can 
measure  the  effects  of  these  emissions  on 
communications-electronics  equipments  and 
gain  reliable  scoring  data  for  EMC  analyses. 


Figure  16.  Field  Testing  Versus  Simulation 


14 


AIR-TO-GROUND  AND  GROUND-TO-AIR 
DETECTION  EXPERIMENTS 

by 

Dr.  Marion  R.  Bryson 
Scientific  Advisor,  U.S.  Army  Combat 
Developments  Experimentation  Command 
Fort  Ord,  California  93941 


OFFICE  OF  NAVAL  RESEARCH 

TARGET  ACQUISITION  SYMPOSIUM 

NAVAL  TRAINING  CENTER,  ORLANDO,  FLORIDA  /  14,15,16  NOVEMBER  1972 


15 


16 


AIR-TO-GROUND  and  GROUND-TO-AIR  DETECTION  EXPERIMENTS 
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Developments  Experimentation  Command 
Ft  Ord  ,  California  93941 


ABSTRACT 

Recently,  a  series  of  experiments  was 
conducted  at  Hunter  Liggett  Military  Reser¬ 
vation,  California  and  at  Fort  Lewis,  Washing¬ 
ton.  The  purpose  of  these  experiments  was 
to  gain  information  on  the  length  of  time  re¬ 
quired  for  a  hovering  helicopter  to  detect  a 
ground  target  and  also  to  determine  the  time 
required  for  the  ground  element  to  detect 
the  helicopter.  This  paper  discusses  the 
detection  times  observed  in  these  experiments 
and  compares  them  with  some  theoretical 
detection  curves. 

INTRODUCTION 

The  US  Army  Combat  Developments  Experi¬ 
mentation  Command  at  Ft  Ord,  California 
has  the  mission  of  performing  experiments 
for  the  army  in  which  equipment  is  used  by 
the  troops  in  the  field.  A  series  of  experi¬ 
ments  to  assess  the  capabilities  of  the  attack 
helicopter  is  currently  being  conducted.  In 
204  experimental  trials  in  this  series ,  time 
from  initiation  of  line  of  sight  to  detection 
was  measured.  These  detections  included 
both  detection  of  a  ground  element  from  a 
helicopter  and  detection  of  the  helicopter  by 
the  ground  element.  All  of  those  trials  in 
which  time  to  detect  was  accurately  measured 
are  included  in  the  analyses  given  in  this  paper. 

Conditions  varied  considerably  from  trial 
to  trial.  Those  factors  which  were  controlled 
and  for  which  there  were  an  approximately 
equal  number  of  trials  for  each  level  are: 


a .  Background .  The  background  of 
the  ground  element  was  either  cluttered 
(many  trees  and  bushes  although  the 
vehicles  were  not  hidden  from  view)  or 
open. 

b.  Movement.  The  ground  element 
was  either  moving  or  stationary. 

c.  Range.  The  range  from  the  hover¬ 
ing  helicopter  to  the  ground  element  was 
either  1,  2  or  3Jdlometers . 

d.  Information.  The  accuracy  of 
the  information  the  air  crew  possessed 
concerning  the  location  of  the  ground 
element  was  either  six  digit  coordinates 
of  its  location  (exact)  or  a  one  square 
kilometer  grid  square  within  which  the 
target  was  located  (1  km^) . 

e.  Crews.  There  were  three  differ¬ 
ent  air  crews  used  in  the  trials . 

There  were  three  other  controlled 
factors  for  which  all  levels  were  not 
equally  represented .  They  are: 

(1)  On  board  sighting  system/aircraft 
type.  In  about  two-thirds  of  the  trials, 
the  aircraft  was  a  UH-1B  with  an  XM-26 
fire  control  system  aboard.  This  is  the 
TOW  sighting  system.  The  remaining 
one  third  were  equally  divided  between 
two  AH-1G  helicopters ,  one  with  the 
XM-76  stabilized  binoculars  and  the 
other  with  the  XM-127  multi-weapons 
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fire  control  system. 

(2)  Aircraft  background .  The  aircraft, 
while  hovering,  was  silhouetted  against  the 
sky  less  frequently  then  it  had  a  terrain 
background . 

(3)  Geographical  location.  About  two 
thirds  of  the  trials  were  at  Hunter  Liggett 
Military  Reservation  in  Central  California 
and  the  remainder  were  at  Fort  Lewis  , 
Washington. 

DATA  COLLECTION  METHODS 

A  candidate  tactic  for  employment  of  the 
attack  helicopter  is  called  the  pop-up 
maneuver.  In  this  maneuver,  the  helicopter 
crew  is  given  (with  variable  accuracy)  the 
location  of  an  enemy  ground  element ,  generally 
an  armored  target,  by  a  forward  observer. 

At  the  time  the  air  crew  receives  this  infor¬ 
mation  they  are  3  to  10  kilometers  behind 
the  lines.  The  helicopter  will  fly,  nap-of-the- 
earth ,  toward  the  enemy  until  they  are  within 
the  range  of  the  anti-armor  defensive  weapon 
carried  by  the  helicopter.  Because  of  its  low 
altitude  and  the  crews  judicious  use  of  cover, 
it  is  presumed  that  the  helicopter  has  remain¬ 
ed  hidden  from  the  ground  targets  during  this 
approach.  Once  in  range,  the  helicopter  will 
come  to  a  hover,  behind  a  terrain,  vegetation, 
or  cultural  feature  which  masks  it  from  the 
enemy.  The  helicopter  will  then  rise  verti¬ 
cally  to  gain  line  of  sight  with  the  target. 

It  will  again  come  to  a  hover  just  above  mask 
while  the  crew  searches  for  the  enemy 
armored  target.  Upon  detection,  the  air¬ 
craft  gunner  will  fire  an  anti-armor  weapon 
and,  if  it  is  a  launcher  guided  missile,  will 
track  the  missile  to  the  target.  The  heli¬ 
copter  will  then  regain  mask  as  soon  as 
possible . 

Although  in  the  trials  several  time 
measurements  were  made,  we  are  interested 
here  only  in  detection  times  and  total  air¬ 
craft  exposure  times.  To  get  an  accurate 
measure  of  the  detection  time,  the  trials 
were  carefully  controlled.  The  ground 


element ,  which  consisted  of  five  armored 
vehicles ,  tanks  and  armored  personnel 
carriers ,  was  prepositioned  on  the  ground . 
One  helicopter  was  brought  in  at  nap-of- 
the -earth  to  a  predesignated  launch 
position,  remaining  masked  from  the 
ground  element.  On  top  of  the  masking 
position,  always  a  hill,  was  located  a 
controller  with  whom  the  aircraft  and 
another  controller  with  the  ground 
elements  had  radio  communication. 

The  levels  of  the  eight  factors  discussed 
above  are  recorded.  The  stage  is  now 
set  for  a  trial  to  begin. 

The  hilltop  controller  radios  the  air¬ 
craft  to  unmask.  At  the  time  line  of 
sight  first  exists  between  the  helicopter 
and  the  ground  elements ,  as  determined 
by  the  hilltop  controller,  he  starts  two 
stopwatches.  The  ground  controller 
with  the  armor  force  also  starts  a 
watch  at  the  time  line  of  sight  first 
exists.  When  a  crew  member  of  the  heli¬ 
copter  detects  a  ground  target,  he 
informs  the  hilltop  controller  who  then 
stops  one  of  his  watches.  Similarly, 
when  a  crew  member  of  the  ground 
element  detects  the  helicopter,  he 
tells  the  ground  controller.  When  the 
helicopter  regains  mask,  the  hilltop 
controller  stops  his  other  watch,  there¬ 
by  measuring  total  exposure  time. 
Safeguards  are  employed  to  identify 
false  detects.  For  the  purpose  of  this 
paper  there  are  11  items  of  information 
recorded  per  trial: 

a.  Range. 

b.  Aircraft  crew . 

c.  Target  acquisition  device. 

d.  Ground  target  background . 

e.  Ground  target  movement. 

f .  Ground  target  location  information. 
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g .  Aircraft  background . 

h.  Geographic  location. 

i.  Time  for  aircraft  crew  to  detect. 

j.  Time  for  ground  to  detect  the  aircraft 
(if  a  detection  occurred). 

k.  Total  exposure  time. 

FACTORS  INFLUENCING 
DETECTION  TIMES 

When  the  data  from  the  204  trials  were 
analyzed,  those  factors  found  to  have  statis¬ 
tically  significant  effect  upon  air-to-ground 
detection  time  were  crew ,  ground  target  back¬ 
ground  and  geographic  location.  Although  the 
significance  of  these  factors  is  not  surprising, 
one  would  have  expected  that  range  and  target 
movement  would  have  had  an  effect  as  well. 
Such  is  not  the  case.  The  range-information 
interaction  was  significant.  Six  digit  coordi¬ 
nate  information  helped  locate  the  target  at 
the  long  ranges  but  was  of  no  value  at  short 
ranges . 

With  regard  to  ground-to-air  detection 
times  (and  detection  probabilities)  range 
and  geographic  location  were  statistically 
significant  factors.  There  are  two  surpris¬ 
ing  results  here: 

a.  That  whether  the  aircraft  had  a  sky 
or  a  terrain  background  made  little  differ¬ 
ence  as  regards  its  detectability  and 

b.  That  whether  the  trial  was  at  Hunter 
Liggett  or  at  Fort  Lewis  did  have  a  large 
effect . 

A  reasonable  explanation  for  both  of 
these  results  is  based  on  the  principle  that 
shape  contrast  rather  than  color  or  reflec¬ 
tive  contrast  may  be  a  more  important 
factor  in  detection  of  targets  of  this  nature. 
The  terrain  mask  above  which  the  aircraft 
is  hovering  during  the  trial  is  more  varied 
at  Ft  Lewis  ,  primarily  because  of  heavier 
vegetation. 


ANALYSIS  OF  AIR-TO-GROUND 
DETECTIONS 

A  common  theoretical  approach  used 
in  search  theory  assumes  that  the  prob¬ 
ability  the  target  is  detected  during 
any  given  unit  of  time,  under  the  condi¬ 
tions  that  it  has  not  been  detected  up 
to  that  time ,  is  independent  of  search 
time.  This  leads  to  the  well  known 
equation 

pd  (t)  =  i-(i-p)t  (i) 

where 

P^(t)  =  Probability  a  detection  has 
occurred  in  t  units  of  time. 

t  =  Number  of  units  of  time  search 
has  progressed. 

P  =  Probability  a  detection  occurs 
in  any  single  unit  of  time. 

The  value  of  the  parameter  of  P  can  be 
estimated  from  empirical  data  if  one 
assumes  that  the  empirical  observations 
all  come  from  the  same  population.  In 
the  analysis  which  follows  ,  all  204 
observations  were  combined  as  if  they 
had  a  common  parametric  value .  This 
is  a  reasonable  approach  if  the  assump¬ 
tion  is  made  that  the  levels  of  the 
eight  variables  discussed  in  section  1 
are  randomly  chosen  and  representative 
of  what  would  occur  in  actual  combat. 

A  logarithmic  transformation  of 
equation  (1)  yields 

t  (log  (1-P)]  =  log  [1-Pd(t)  ]  (2) 

which  is  linear  in  t.  Fitting  the  trans¬ 
formed  empirical  data  to  this  equation, 
using  least  squares  ,  yields  a  value  of 
.062  for  P. 

Table  1  shows  the  empirical  cumula¬ 
tive  probabilities  of  detection  along 
with  the  theoretical  probabilities 
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TABLE  1 

Probability  of  Detection:  A  G 


Time  of 

Search  (t) 

Observed  Percent  of 

Detections  in  time  (t) 

Expected  Percent  of 
Detections  in  time  (t) 

3 

16 

17 

6 

32 

32 

9 

48 

44 

12 

56 

53 

15 

63 

61 

18 

66 

68 

21 

75 

74 

24 

80 

78 

27 

82 

82 

30 

85 

85 

33 

87 

88 

36 

90 

90 

39 

92 

92 

42 

93 

93 

45 

94 

94 

48 

95 

95 

computed  using  a  single  second  detection 
probability  of  .062.  Data  were  grouped  into 
three  second  increments  to  eliminate  some 
of  the  "noise "  in  the  data.  The  fit  is 
extremely  good. 

ANALYSIS  OF  GROUND-TO-AIR 
DETECTIONS 

The  ground-to-air  detection  data  present 
an  interesting  problem  in  data  truncation. 

We  possess  data  on  time  to  detect  for  those 
trials  in  which  the  ground  elements  detected 
the  helicopter.  All  we  know  about  those 
trials  where  no  detection  occurred  is  that 
the  time  to  detect  for  that  trial  would  have 
been  longer  than  the  total  aircraft  exposure 
time.  Neither  the  use  of  only  those  trials 
in  which  detection  occurred  nor  the  use  of 
all  trials  is  satisfactory  for  obtaining  a 
cumulative  probability  curve. 

An  unbiased  estimate  of  the  probability , 
P^(t),  the  helicopter  will  be  detected  in 
time  t  can  be  made.  To  obtain  this  estimate  , 
one  considers  only  those  trials  for  which  the 
total  aircraft  exposure  time  equalled  or 


exceeded  t.  If  the  exposure  time  was 
less  than  t ,  the  trial  is  ignored , 
whether  there  was  a  ground-to-air 
detection  or  not.  A  completely  separate 
estimate  of  Pd(t)  is  made  for  each  time 
of  interest.  The  estimator  is 

Pd(t)  =  D  (t)  (3) 

E  (t) 

where 

E(t)  =  Number  of  trials  for  which 
the  aircraft  was  exposed  for 
time  t  or  greater. 

D(t)  =  Number  of  the  E(t)  trials  in 

which  the  aircraft  was  detect¬ 
ed  by  the  ground  _in  time  t 
or  less. 


Table  2  shows  the  values  of  the  factors 
in  equation  (3)  for  selected  times.  This 
function  need  not  be  monotone  increas¬ 
ing  with  time  due  to  random  variation. 
Under  some  general  conditions  ,  the 
expected  value  of  P^(t)  is  monotone. 
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TABLE  2 

Estimated  Probability  of  Ground-to-Air  Detections 


Time(t) 

#  Aire  raft  exposed 
for  t  seconds:  E(t) 

#  G*A  detections: 
D(t) 

pd(t) 

3 

204 

11 

.05 

6 

204 

25 

.12 

9 

202 

35 

.17 

12 

189 

48 

.25 

15 

167 

47 

.28 

18 

149 

52 

.35 

21 

128 

52 

.41 

24 

104 

44 

.42 

27 

89 

41 

.46 

30 

77 

35 

.46 

33 

65 

34 

.52 

36 

54 

26 

.48 

39 

44 

21 

.48 

42 

35 

17 

.49 

45 

29 

14 

.48 

It  appears  from  Table  2  that  unless  the  air¬ 
craft  is  detected  in  about  the  first  30  seconds 
of  exposure  it  will  not  be  detected  at  all. 
Closer  examination  of  the  data  reveal  two 
interesting  facts. 

a.  There  were  several  (about  15%) 
detections  in  excess  of  30  seconds.  This 
would  appear  to  be  in  direct  contradiction  to 
the  data  shown  in  Table  2 . 

b.  There  is  a  striking  correlation  between 
aircraft  exposure  time  and  expected  time  to 
detect.  This  is  not,  as  one  might  at  first 
think,  caused  by  the  fact  that  a  long  detec¬ 
tion  time  can  only  come  in  a  trial  where 
there  is  a  long  exposure  time.  In  those  trials 
where  the  exposure  time  was  30  seconds  or 
less  the  aircraft  was  detected  about  53%  of 
the  time  (detection  coming  in  the  first  30 
seconds  obviously).  In  those  trials  where 

the  exposure  was  greater  than  30  seconds 
the  aircraft  was  detected  within  30  seconds 
in  only  37%.  This  difference  becomes  even 
more  impressive  when  one  considers  that 
the  searcher  looked  for  at  least  a  full  30 
seconds  in  those  trials  for  which  exposure 
exceeded  30  seconds,  and  had  only  aircraft 


exposure  time  in  which  to  search  in  the 
shorter  trials .  This  correlation  explains 
the  '’flattening”  of  the  ground-to-air 
detection  curve  for  t  greater  than  30 
seconds . 

Fitting  the  same  theoretical  curve 
described  in  section  4  above  to  the  data 
shown  in  the  last  column  of  Table  2 
yields  a  poor  fit,  as  expected.  At  mid¬ 
times  ,  around  21  seconds  ,  the  curve  is 
about  10  units  below  the  empirical  and 
at  45  seconds  it  is  about  10  units  above. 
Using  data  from  only  the  first  27  seconds 
for  fitting  the  curve  ,  an  excellent  fit 
is  obtained.  The  empirical  and  theoreti¬ 
cal  curves  differ  by  at  most  two  per¬ 
centage  points.  The  single  second  prob¬ 
ability  of  detection  thus  calculated  is 
.023.  Table  3  shows  the  ground-to-air 
theoretical  cumulative  probabilities  of 
detection. 

SUMMARY 

It  is  felt  that  the  data  collected 
in  these  experiments  are  valid  in 
estimating  detection  probabilities  for 
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TABLE  3 


Probability  of  Detection:  G-A 


Time  of 
Search 

Observed  %  of  detec¬ 
tions  in  time  t 

Expected  %  detec¬ 
tions  using  all  data 
for  fitting  curve 

Expected  %  detections 
using  only  27  seconds 
of  data  for  fitting 
curve 

3 

5 

5 

7 

6 

12 

10 

13 

9 

17 

15 

19 

12 

25 

19 

24 

15 

28 

24 

30 

18 

35 

28 

34 

21 

41 

31 

39 

24 

42 

35 

43 

27 

46 

38 

47 

30 

46 

42 

33 

52 

45 

36 

48 

48 

39 

48 

50 

42 

49 

53 

45 

48 

55 

random  exposures  between  1  and  3  kilometers . 
Both  air-to-ground  and  ground-to-air  detec¬ 
tion  times  fit  the  theoretical  curves  very 
well.  Using  the  single  second  air-to-ground 
and  ground-to-air  detection  probabilities  , 

.062  and  .023  ,  probabilities  of  first  detection 
were  calculated.  Theoretically,  the  air 
should  detect  first  72%  of  the  time,  the 
ground  first  26%  and  there  should  be  2%  ties 
(detection  occurring  the  same  second).  In 
the  experiment  ,the  air  detected  first  68% 
of  the  time,  the  ground  28%  and  4%  were 
ties .  This  adds  further  to  the  validity  of 
the  results  of  the  analysis . 
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ABSTRACT 

Various  measures  of  observer  performance  at  the  task  of  finding  and  recognizing  targets  are  dis¬ 
cussed  along  with  consideration  of  the  type  of  behavior  required  to  do  well  on  each  measure.  It 
is  shown  that  to  do  well  by  different  criteria  requires  different  observer  behavior.  Experimental 
data  is  given  which  compares  observers  on  different  performance  measures  to  illustrate  the 
problem  of  selecting  the  most  proficient  observers.  Conclusions  are  drawn  about  observer  se¬ 
lection  and  its  impact  upon  reconnaissance/strike  systems  effectiveness. 


In  a  reconnaissance,  strike,  or  reconnais¬ 
sance/strike  system  there  is  a  complex  of 
equipment  and  men.  It  appears  reasonable  to 
assume  that  maximum  effectiveness  of  the 
man-machine  system  would  be  obtained  by 
utilizing  the  best  men  and  the  best  machines. 
However,  the  best  machines  may  be  too  expen¬ 
sive  or  unavailable  for  other  reasons,  and  fre¬ 
quently  one  does  not  know  who  of  a  group  of 
available  men  is  the  best.  How  does  one  select 
men  and  how  does  one  select  an  affordable  con¬ 
figuration  of  equipment?  To  what  extent  can 
quality  of  men  make  up  for  deficiencies  in 
machines,  and  vice  versa?  The  answers  to 
such  questions  are  extremely  complex  and  are 
far  from  complete  at  the  present  time.  How¬ 
ever,  it  is  certain  that  to  arrive  at  answers 
one  will  have  to  deal  with  mission-relevant 
measures  of  human  performance,  equipment 
performance,  and  overall  or  system  effective¬ 
ness. 


Characteristics  of  the  men,  of  the  equip¬ 
ment,  and  of  the  situation  or  mission  inter¬ 
act,  so  that  variations  in  equipment  and/or 
situation  will  have  an  effect  on  the  men  in  the 
system.  This  is  saying  that  a  systems  ap¬ 
proach  is  essential  when  dealing  with  a  sys¬ 
tem.  Supermen  and  supermachines  are 
futile  unless  they  are  matched  to  each  others 
characteristics  and  to  the  mission,  and  there 
are  no  universal  systems. 

A  corollary  is  that  there  are  no  universal 
men,  that  is  men  who  are  consistently  the 
best  by  all  performance  criteria.  It  will  be¬ 
come  obvious  why  this  is  true  when,  later  in 
this  paper,  mission-relevant  performance 
criteria  are  examined.  What  are  these 
criteria? 

The  first  to  come  to  mind  is  the  number 
and  percentage  of  targets  that  are  detected, 
with  the  latter  having  more  generality.  An 
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ideal  observer  would  detect,  within  the  allow¬ 
able  time,  all  targets  that  were  displayed  with 
some  minimum,  at  present  unspecifiable, 
image  quality.  A  real  observer  falls  short  of 
the  ideal,  often  by  a  surprising  or  even  shock¬ 
ing  amount.  Pressed  for  or  limited  by  time, 
using  improper  search  procedures,  etc.,  he 
often  misses  targets  displayed  with  excellent 
image  quality  and  satisfactory  image  size.  To 
obtain  a  high  %D  score,  an  observer  may  be 
forced  to  label  many  images  of  marginal  res¬ 
olution  and/or  of  doubtful  identity,  "targets". 
He  will  make  mistakes. 

A  measure  of  mistakes  is  "accuracy": 

The  percentage  of  the  objects  that  he  desig¬ 
nates  as  targets  that  truly  are  targets.  An 
information-equivalent  measure  is  percentage 
of  false  positives,  i.e. ,  objects  mistaken  for 
targets.  An  ideal  observer  would  make  no 
mistakes:  His  accuracy  would  be  100%.  A 
real  observer  can't  do  this  because  real  sen¬ 
sors  in  the  real  world  don't  display  all  targets 
with  ideal  contrast  and  resolution.  Often 
some  nontargets  look  more  like  real  targets 
than  do  some  targets.  An  observer  who  wants 
to  obtain  a  high  accuracy  score  has  to  be  very 
sure  of  what  he  calls  targets:  He  cannot  call 
many  marginal  signatures  targets  without 
making  errors.  Thus,  to  score  high  on  accu¬ 
racy  he  has  to  be  very  cautious,  while  a  high 
%D  score  is  obtained  by  being  very  reckless. 

In  weapon  delivery  situations  it  is  usually 
advantageous  to  detect  and  recognize  targets 
while  they  are  still  at  long  slant  ranges.  Some 
weapon  systems  even  require  weapon  release 
at  long  range.  Since  the  slant  range  is  usually 
decreasing  rapidly  due  to  motion  of  the  air¬ 
craft,  detection  and  recognition  at  long  slant 
ranges  is  achieved  by  quick  acquisition:  Short 
reaction  or  detection  times  are  required. 
However,  at  long  ranges  target  images  on  the 
display  are  small  and  poorly  resolved.  To 
obtain  a  large  target  acquisition  slant  range 
score,  i.e.,  quick  detection,  requires  reck¬ 
less  behavior.  If,  to  avoid  mistakes,  one 
waits  until  the  displayed  image  is  large  and 
clearly  resolved,  then  one  will  not  obtain  a 
"good"  reaction  time  score.  To  obtain  an 
excellent  time  score  also  requires  ignoring 


targets  first  seen  at  close  range  and  making 
snap  judgments  amounting  to  guesses  on  small 
target  images  at  the  top  of  the  display.  Thus, 
a  good  reaction  time  score  is  obtained  at  the 
cost  of  a  low  %D  score  and  a  low  accuracy 
score. 

Tabic  1  summarizes  the  behavior  re¬ 
quired  for  high  scores  on  each  of  the  three 
measures  of  performance.  The  table  and  the 
above  discussion  make  it  clear  that  one  cannot 
do  extremely  well  on  all  three  performance 
measures  because  the  types  of  behavior  re¬ 
quired  to  do  so  are  incompatible  or  contra¬ 
dictory.  The  ideal  observer  would  detect  all 
of  the  targets  (%D  =  100%)  ,  mistake  no  non¬ 
targets  for  targets  (accuracy  =  100%) ,  and 
would  detect  and  recognize  targets  the  instant 
that  their  images  appeared  on  the  display 
(reaction  time  =  0) .  A  real  observer  cannot 
even  come  close  to  such  behavior. 

All  three  of  the  performance  measures 
(%D,  accuracy,  and  speed)  are  important.  In 
a  particular  situation  (or  mission)  the  best 
men  to  use  would  be  those  who  do  best  on  the 
criteria  of  most  importance  in  the  particular 
situation.  More  accurately,  men  should  be 
selected  whose  abilities  on  various  measures 
of  performance  form  the  best  match  with  the 
performance  requirements  of  the  task.  For 
example,  how  well  an  observer  does  on  the 
performance  criteria  depends  upon  both  his 
ability  and  his  emphasis.  It  is  easy  for 
observers  to  vary  the  relative  emphasis  that 
they  give  to  different  aspects  of  performance. 

Thomas  and  Sadacca  (1965)*  demon¬ 
strated  that  image  interpreters  can  vary  their 
performance  when  the  relative  weight  assigned 
to  %D  and  accuracy  is  varied.  The  discus¬ 
sion  in  the  present  paper  makes  it  apparent 
that  to  do  well  on  one  measure  of  performance 
may  mean  poor  performance  on  another. 


*  Thomas,  J.A.,  and  Sadacca,  R.  Ability  of 
Image  Interpreters  to  Adapt  Output  to  Varying 

Requirements  for  Completeness  and  Accuracy, 

USA  PRO  Technical  Research  Note  165, 
December  1965. 
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TABLE  I.. 


Performance  Measure 
or  Score 

Behavior  Required  to  Obtain  a  Very  Good  Score 

%D 

Label  every  possible  image  a  target:  Behave  recklessly. 
Don’t  watch  a  possible  target  move  down  the  display, 
growing  in  image  size  and  resolution,  before  deciding, 
because  this  takes  time  that  could  be  devoted  to  search¬ 
ing  for  other  targets.  Thus,  quick  reaction  is  important. 

Accuracy 

Be  very  cautious  or  conservative:  Be  sure  it  is  a  target 
before  calling  it  one.  Let  its  image  move  way  down  the 
display,  becoming  large  and  clear,  before  reacting: 

Take  your  time,  better  be  slow  than  wrong. 

Reaction  Time  or 

Slant  Range 

React  Quickly:  Don't  wait  for  better  resolution :  Detect 
and  recognize  targets  at  the  top  of  the  display.  Don’t 
waste  time  searching  the  middle  or  bottom  of  the  disolay, 
as  targets  acquired  there  are  close  and  yield  long  reac¬ 
tion  times.  Reckless  behavior  pays  off  in  long  detection 
ranges. 

It  is  clear  that  measuring  even  highly  trained 
men  for  ability  so  that  the  best  man  can  be  se¬ 
lected  is  not  a  simple  matter.  It  would  not  be 
either  simple  or  easy  even  if,  for  particular 
missions,  one  were  given  realistic  "weights" 
for  each  of  the  three  types  of  performance 
measures. 

Despite  the  obvious  difficulties  in  deter¬ 
mining  who  is  "best",  it  is  important  that  it  be 
done.  This  is  true  because,  with  either  un¬ 
trained  or  highly  trained  observers  it  is  com¬ 
mon  to  find  some  individuals  who  are  far  supe¬ 
rior  to  others  on  any  performance  measure 
that  one  may  select.  Differences  of  50% to 
200% are  common,  and  even  larger  differences 
are  not  rare.  To  obtain  such  large  differences 
in  the  performance  of  man-machine  systems 
by  using  superior  sensors,  for  example,  could 
require  vastly  more  expensive  equipment.  For 
some  state-of-the-art  systems  no  amount  of 
expended  funds  on  research,  development,  and 
procurement  could  quickly  (or  even  not  so 
quickly)  upgrade  system  performance  as  much 
as  could  more  efficient  observers. 

Prior  discussion  has  been  on  theory,  but 


it  is  easy  to  illustrate  some  of  the  main 
points  by  examination  of  data  from  empirical 
studies.  Three  such  studies  by  the  author 
will  be  used.  In  the  first  study,  fourteen 
observers  had  to  find  and  recognize  unbriefed 
targets  on  a  rear-projection  screen  display  of 
motion  pictures  which  had  been  taken  from  an 
aircraft  at  a  340-foot  altitude.  The  forward- 
looking  camera  had  a  field-of-view  extending 
from  a  near  slant  range  of  1950  feet  to  a  far 
slant  range  of  4910  feet.  At  a  simulated  air¬ 
craft  speed  of  only  60  knots  they  had  to  find 
targets  of  opportunity:  men,  cars,  trucks, 
and  heavy  construction  equipment.  The  mean 
scores  for  the  observers  and  their  rankings 
or  relative  positions  on  performance  are 
given  in  Table  II. 

On  %D,  the  best-to-worst  score  ratio 
was  76. 54%/50. 12%,  or  1.53;  the  top  observer 
on  %D  detected  and  recognized  53%  more  tar¬ 
gets  than  did  the  bottom  man.  The  ratio  for 
accuracy  was  91.25/46.49,  or  1.96.  On 
reaction  time  (detection  +  recognition)  the 
average  overall  targets  ranged  from  8.62 
seconds  to  15.90  seconds,  a  ratio  of  1.84. 

On  screen  position  the  ratio  was  62.76/37.26, 
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or  1.68.  Considerable  range  in  scores  is 
shown  by  the  size  of  the  standard  deviations  at 
the  bottom  of  the  table. 

The  rank  or  relative  position  of  individual 
observers  is  of  some  interest  since  it  permits 
a  quick  grasp  of  the  relative  standing  of  indivi¬ 
dual  observers  across  two  or  more  measures 
of  performance.  Table  II  shows  that  no  ob¬ 
server  was  number  one  (best)  on  more  than 


one  measure  and  only  one  person,  "M",  was 
number  14  (worst)  on  as  many  as  two  meas¬ 
ures.  Looking  at  the  last,  or  mean  rank 
column,  it  can  be  seen  that  individuals  "F" 
and  uG!f  do  relatively  well  on  an  overall 
basis.  Thus,  despite  the  incompatibility  in¬ 
herent  in  obtaining  high  scores  on  every  per¬ 
formance  measure,  a  few  observers  did 
relatively  well  overall. 


TABLE  II 

OBSERVER  PERFORMANCE  WITH  LOW  ALTITUDE  MOTION  PICTURES 
SIMULATING  AN  AIRCRAFT  SPEED  OF  60  KNOTS 


%D 

Accuracy 

Reaction 

Time 

Screen 

Position  + 

Mean++ 

Observer 

Score 

Rank 

Score 

Rank 

Score 

Rank 

Score 

Rank 

Rank 

A 

58.02 

8 

60.76 

12 

10.18 

8 

41.59 

6 

8.50 

B 

68.15 

4 

67.22 

7 

8.75 

4 

35.16 

7 

6.00 

C 

58.26 

7 

58.95 

13 

11.98 

11 

51.65 

10 

10.25 

D 

56.47 

10 

75.61 

5 

9.95 

7 

37.26 

1 

5.75 

E 

50.85 

13 

66.35 

10 

10.65 

9 

39.78 

4 

9.00 

F 

65.39 

6 

80.52 

4 

8.62 

1 

39.47 

5 

4.00 

G 

76.54 

1 

74.97 

6 

8.65 

2.5 

35.34 

2 

2.88 

H 

54.09 

11 

72.31 

7 

15.58 

13 

63.57 

13 

11.00 

I 

68.74 

3 

90.38 

2 

14.80 

12 

62.26 

12 

7.25 

J 

57.90 

9 

64.33 

11 

8.65 

2.5 

35.92 

3 

6.38 

K 

70.77 

2 

46.49 

14 

9.70 

5 

41.37 

9 

7.50 

L 

50. 12 

14 

91.25 

1 

9.90 

6 

37.71 

8 

7.25 

M 

52.94 

12 

70.59 

8 

15.90 

14 

62.76 

14 

12.00 

N 

68.02 

5 

84.78 

3 

11.40 

10 

50.23 

11 

7.25 

Mean 

S.D.  ** 

61.16 

8.31 

71.75 

12.49 

11.04 

2.59 

45.29 

10.73 

The  scores  are  the  means  of  the  separate  scores  on  four  types  of  targets:  people,  cars, 
trucks,  and  heavy  construction  equipment. 

**  Standard  deviation  of  scores,  a  measure  of  scatter  or  individual  differences. 

+  Screen  position  is  the  distance  down  from  the  top  of  the  display  when  the  observer 
designated  an  object  as  a  target. 

++  Mean  rank  is  the  average  of  the  four  ranks. 
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The  nature  of  the  relationship  between  test 
scores  on  the  various  measures  of  performance 
becomes  apparent  when  they  are  plotted 
against  each  other,  two  at  a  time,  in  scatter 
plots  or  diagrams.  The  six  possible  diagrams, 
with  ranks  plotted  against  each  other,  along 
with  the  rank  correlation  coefficient,  for  each 
set  are  given  in  Figure  1.  Note  that  four  of  the 
six  coefficients  are  less  than  .1,  i.e., 
essentially  zero:  %D  -  accuracy,  %D  -  screen 
position,  accuracy-reaction  time,  and 
accuracy-screen  position.  Even  the  %D  - 
reaction  time  correlation  coefficient  is  a  low 
.3462,  which  is  significantly  different  from  0 
(tl2  =  3.69)  ,  but  too  small  to  be  valuable  in 
predicting  one  score  from  the  other.  The  r' 
of  .7525  for  reaction  time-screen  position  at 
detection  is  not  very  high.  It  is  largely  due  to 
the  bulk  of  the  targets  moving  straight  down 
the  display  at  a  uniform  rate  until  detected  so 
that  for  most  targets,  one  measure  is  directly 
convertible  to  the  other  by  a  simple  linear 
equation. 

The  story  told  by  the  values  of  the  rank 
correlation  coefficients  is  obvious  from  the 
scatterplots:  instead  of  lying  on  a  straight 
line  or  in  a  narrow  ellipse,  which  would  indi¬ 
cate  a  high  degree  of  relationship,  individuals 
are  widely  scattered.  Only  individuals  "F" 
and  "G"  are  in  the  lower  left  corner,  indicat¬ 
ing  good  scores,  on  all  graphs.  Only  indivi¬ 
duals  H  (5  out  of  6  plots)  and  M  (4  out  of  6 
plots)  are  fairly  consistently  in  the  upper 
right  corner,  indicating  inferior  performance. 

A  somewhat  similar  lack  of  all-around 
excellence  in  observer  performance  appears 
in  a  second  study  by  the  author.  In  this  study 
twenty  SAC  radar  navigators  had  to  find  un¬ 
briefed  targets  on  a  rear-projection  screen 
display  of  side-looking  radar  imagery.  An 
image  of  a  strip  of  territory  41. 5  nautical 
miles  wide  and  500  nautical  miles  long  moved 
down  the  display  at  a  rate  which  simulated  an 
aircraft  speed  of  700  knots.  Images  required 
215  seconds  to  move  from  the  top  of  the  dis¬ 
play  to  the  bottom,  so  that  even  at  700  knots 
observers  were  not  rushed  for  time. 

The  basic  performance  data  is  given  in 


Table  III.  Note  that  the  percentage  of  targets 
detected  varied  from  a  low  of  13.  l%to  a  high 
of  38.5%,  accuracy  varied  from  35. 29% to 
10.34%,  and  screen  position  varied  from  1.9 
to  7,  2  elevenths  of  the  way  down  the  display. 
Thus,  on  any  of  the  three  performance 
measures  there  was  a  ratio  of  about  three  or 
greater  between  the  highest  and  lowest 
scores  . 

The  scatter  plots  of  the  ranks  of  the  SAC 
observers  are  shown  in  Figure  2.  Note  that 
individual  "C"  is  rather  poor  on  all  measures 
while  observer  "J"  does  not  do  badly  on  any 
measure.  However,  no  person  has  a  rank  of 
lower  (better)  than  7  on  all  performance 
measures:  No  observer  is  superior  on  all 
three  performance  measures. 

The  last  study  used  for  illustrative  data 
used  low  altitude  strip  photography  displayed 
at  slow  helicopter  speeds:  4.2  knots  for  one 
group  of  observers  and  10.5  knots  for 
another.  Targets  were  manhole  covers  on 
streets  and  garbage  cans  in  yards  and  alleys. 
Table  IV  gives  the  data  %D  was  about  2:1 
from  best  to  worst  observer  at  both  of  the 
speeds,  accuracy  in  both  was  about  1. 2  to  1, 
and  the  reaction  time  ratio  was  25.3/5.8,  or 
4.4:1  at  the  slowest  speed  and  10.  5/3.  9  or 
2,  7 :1  at  the  10. 5  knot  speed.  Thus,  on%D 
and  time  (or  target  acquisition  range) ,  the 
range  of  individual  differences  was  large, 
while  all  observers  were  rather  close 
together  on  accuracy.  Look  at  the  six 
scatter  plots  in  Figure  3,  with  particular 
attention  to  the  lower  left  hand  corners 
(  "good"  performance)  and  the  upper  right 
hand  corners  (bad  performance)  .  It  may  be 
noted  that  the  same  individuals  don't  occupy 
the  same  corners  on  all  three  plots  at  each 
speed.  Few  individuals  are  really  close  to 
the  corner  on  even  two  plots.  Thus,  once 
again,  it  can  be  concluded  that  observers 
were  not  uniformly  good  or  bad.  Since  none 
of  the  correlation  coefficients  were  larger 
in  magnitude  than  .5121,  it  follows  that 
rank  on  any  one  performance  measure  has 
little  predictive  value  for  rank  on  another. 
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Rank  on  Accuracy  Rank  on  Reaction  Time  Rank  on  Screen  Position 


Rank  on  Reaction  Time 


56  789  10  111213  14 
Rank  on  Accuracy 


Rank  on  Accuracy 

Figure  1.  Scatter  Plots  of  Performance  Rankings  of  Ten  Observers  Using  Motion 
Picture  Imagery. 
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TABLE  III 


PERFORMANCE  OF  SAC  RADAR  NAVIGATORS  WITH  SLAR*  IMAGERY 
AT  A  SIMULATED  AIRCRAFT  SPEED  OF  700  KNOTS 


Performance  Measure 

Mean 

Rank 

Observer 

%D 

Rank 

Accuraey+ 

Rank 

Screen 

Position++ 

Rank 

A 

33.1 

5 

21.51 

15 

|  2.650 

5 

8.33 

B 

22.5 

10 

26.05 

9 

3.412 

12 

10.33 

C 

15.9 

18 

10.34 

20 

3.917 

14 

17.33 

D 

38.5 

1 

14.44 

18 

4.185 

15 

11.33 

E 

30.8 

6 

14.04 

19 

2.167 

2 

9.00 

F 

21.8 

11 

31.48 

4 

4.588 

18 

11.00 

G 

17.9 

16.5 

35.00 

2 

4.357 

17 

11.83 

H 

19.2 

14.5 

26.79 

7 

4.200 

16 

12.50 

I 

19.2 

14.5 

20.83 

16 

2.533 

4 

11.50 

J 

26.9 

7 

34.43 

3 

3.143 

10 

6.67 

K 

13.1 

20 

24.39 

12 

j  2.800 

7 

13.00 

i  | 

L 

38.5 

2 

22.06 

14 

3.000 

9 

8.33  | 

M 

37.2 

3.5 

21.14 

13 

2.759 

6 

7.50 

N 

20.5 

12.5 

25.00 

11 

7.250 

20 

14.50 

O 

37.2 

3.5 

,  20.42 

17 

1.923 

1 

7.17 

P 

24.7 

8 

26.76 

8 

2.316 

3 

6.33 

Q 

20.5 

12.5 

31.37 

5 

2.875 

8 

8.50 

j 

R 

15.4 

19 

,  27.91 

6 

3.167 

11 

12.00 

S 

23. 1 

9  , 

n  35.29 

1 

3.667 

13 

7.67 

T 

17.  9 

16.5 

25.45 

10 

6.571 

19 

15.17 

Mean 

24.70 

24.74 

10.50 

3.574 

10.50 

10.50 

S.D. 

8.30 

6.93 

5.95 

1.371 

5.95 

3.05 

*  SLAR  =  Side- Looking  Airborne  Radar. 

+  Accuracy  =  (100)  X  (Number  of  Targets  Detected) /(Detections  +  False  Positives)  = 
(100)  X  (Number  Correct) /(Number  of  Responses)  =  %Correct. 

++  Screen  position  in  elevenths  of  the  way  down  the  14-inch  display.  Since  215  seconds 
was  required  to  move  from  top  to  bottom,  each  eleventh  represents  19.5  seconds. 
NOTE:  Targets  were  unbriefed  dams,  railroad  yards,  industry,  tank  farms  and  airfields. 
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Rank  on  Accuracy  Rank  on  Reaction  Time  Rank  on  Reaction  Time 
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Figure  3.  Scatter  Plots  for  Ranks  on  Performance  Measures  for  Observers 
Using  Strip  Photography. 
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Rank  on  Accuracy 


Rank  on  %D 


Figure  2.  Scatter  Plots  of  Performance  Rankings  of  Twenty  SAC  Radar  Navigators 
Using  Side-Looking  Airborne  Radar. 
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NOTE  (1)  :  Observers  were  arranged  in  order  on  %D  scores  before  being  assigned  letter  designations  in  the  above  table. 
NOTE  (2)  :  Targets  were  unbriefed  manhole  covers  in  streets  and  garbage  cans. 


SUMMARY  AND  CONCLUSIONS 


1 .  The  type  of  behavior  required  for 
doing  well  according  to  one  performance 
measure  may  be  incompatible  with  that  re¬ 
quired  for  doing  well  on  another. 

2.  Examination  of  data  from  human  fac¬ 
tors  studies  reveal  that  observers  differ 
widely  on  performance  scores,  a  range  of  50% 
to  300%  not  being  unusual  on  any  one  type  of 
measurement. 

3.  There  are  very  few,  if  any,  observers 
who  rank  very  high  (or  very  low)  on  all  three 
of  the  most  obvious  performance  measures: 
percent  of  targets  detected,  accuracy,  and 
speed  (time  or  slant  range  or  screen  position 
at  detection)  .  A  few  individuals  do  well  on 
two  of  the  three  measures,  but  not  extremely 
well  on  both. 

4.  The  correlation  coefficients  between 


scores  on  different  combinations  of  the  three 
main  criteria  are  usually  too  low  to  be  of 
practical  use:  Prediction  of  scores  on  one 
measure  from  scores  on  another  is  practi¬ 
cally  not  possible. 

5.  These  facts  indicate  that  the  ob¬ 
server  selection  problem  is  important  but 
very  difficult.  Theory  and  data  both  indicate 
that  one  must  select  for  particular  missions 
or  purposes,  rather  than  for  general  pur¬ 
poses:  Few,  if  any,  observers  are  "tops" 

in  everything.  Effective  selection  devices 
are  needed. 

6.  More  research  is  needed,  particu¬ 
larly  on  the  absolute  and  relative  scores  of 
the  same  observers  in  different  situations 
and  with  instructions  giving  different 
emphasis  on  the  relative  importance  of  %D, 
accuracy,  and  acquisition  range. 


The  research  reported  in  this  paper  was  conducted  by  personnel  of  the  Aerospace  Medical 
Research  Laboratory,  Aerospace  Medical  Division,  Air  Force  Systems  Command,  Wright- 
Patterson  Air  Force  Base,  Ohio.  This  paper  has  been  identified  by  Aerospace  Medical 
Research  Laboratory  as  AMR L-TR -72-86.  Further  reproduction  is  authorized  to  satisfy 
needs  of  the  U.  S.  Government. 
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THE  MYTHOLOGY  OF  TARGET  ACQUISITION  SYSTEM  DESIGN  AND  PERFORMANCE 


Lucien  M.  Biberman 
Institute  for  Defense  Analyses 
Arlington,  Virginia 


Abstract 

This  paper  considers  the  factors  long  associated  with 
human  performance  and  vision  and  contrasts  such  factors 
with  those  that  have  more  recently  been  found  to  have 
greater  relevance.  In  particular,  a  brief  review  is 
made  of  past  experiments  and  present  programs  and  com¬ 
pares  and  contrasts  the  important  factors  emerging  from 
each.  A  series  of  examples  are  given  in  both  visual 
and  acoustical  analogs  to  point  up  some  of  the  more 


commonly  believed  mythology. 

This  afternoon  I  shall  concern  my¬ 
self  only  with  those  factors  that  concern 
transfer  of  information  from  an  image  to 
an  observer,  and  shall  concentrate  on  the 
mythology  surrounding  television-type 
imagery . 

All  that  glitters  is  not  gold;  the 
hand  is  quicker  than  the  eye;  and  the 
"sharp"  television  tube  often  interferes 
with  the  viewer Ts  seeing  the  picture  that 
is  really  there.  But  letfs  get  to  that 
a  little  later. 

The  sudden  interest  in  low  light 
level  technology  in  the  early  sixties 
and  particularly  in  the  first  of  the 
intensif ier-image  orthicon  television 
cameras ,  resulted  in  a  flurry  of  field 
evaluations  which  by  and  large  poorly 
displayed  various  pictures  of  various 
"military  targets"  under  various  circum¬ 
stances  with  little  or  no  control  of 
conditions . 

Yet  some  good  work  was  done  in  which 
the  same  levels  of  "resolution"  as  were 
expected  from  low  light  level  television 
were  simulated  by  projected  photographs, 
but  raster  effects  and  noise  character¬ 
istics  were  not  understood  and  thus 


ignored  or  were  badly  simulated.  As  a 
result  no  real  relationship  was  estab¬ 
lished  that  would  predict,  based  upon 
laboratory  trials  or  photographic  simu¬ 
lations,  how  well  a  man  actually  using 
a  television  set  of  a  given  type  would 
perform.  At  this  point,  I  am  quite 
careful  not  to  say  anything  about  the 
relationship  between  system  quality 
and  observer  performance.  The  reason 
is  that  the  television  systems  were 
specified,  ordered,  designed  and  de¬ 
livered  on  the  basis  of  a  number  of 
system  parameters  that  had  little  or 
nothing  to  do  with  image  quality  or 
visual  performance,  i.e.,  the  basic 
factors  that  allowed  people  to  see 
things  on  a  television  display. 

There  were  some  who  attempted 
to  carry  out  human  factors  experi¬ 
ments  with  television  earlier,  but 
some  of  the  first  quantitative  work 
of  interest  to  me  was  carried  out 
by  Ronald  Erickson  and  his  associates 
under  conditions  in  which  television 
images  of  relatively  high  quality 
were  used  to  test  human  ability  to 
perform  visual  tasks. 

Dorothy  Johnson  (1968)  conducted 
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a  series  of  experiments  concerned  with 
target  recognition  as  a  function  of  slant 
range,  horizontal  resolution  and  shades 
of  gray.  The  measured  performance  para¬ 
meters  were  target  recognition  time  and 
probability.  Ms.  Johnson Ts  results 
showed  correlation  between  the  parameters 
studied  . 

Scott,  Hollanda  and  Harabedian 
(1970)  carried  out  a  series  of  simula¬ 
tions  of  TV  imagery  through  the  use  of 
Frank  Scott Ts  line  scan  generator.  This 
work  was  like  that  of  Hemingway  and 
Erickson  on  symbols  except  vehicular 
images  were  used  as  inputs  and  perspective 
was  an  important  but  not  consistent 
parameter.  They  reported  the  identifi¬ 
cation  and  classification  accuracy  of 
noiseless ,  static  line-scan  images  as  a 
function  of  the  number  of  scans  per 
vehicle  (4,  6,  9,  13.5,  20  and  30  scans) 
and  angle  of  view  (nadir  and  45  degree 
obliquity  angle) . 

In  a  related  work,  Gaven  and 
Tavitian  (1970)  study  the  effect  of 
shades  of  gray  and  the  number  of  scan 
lines  on  observer  performance.  They 
studied  the  informative  value  of  sampled 
imagery  as  a  function  of  the  number  of 
sampling  elements  and  the  number  of  bits 
used  to  encode  the  gray  levels  in  the 
imagery . 

The  above  series  of  experiments  are 
not  the  only  work  that  has  been  done,  but 
are  representative  of  a  class  of  psycho¬ 
physical  experiment  that  relates  perform¬ 
ance  to  display  parameters  that  may  in¬ 
deed  not  be  the  independent  variable. 

The  line  scan  generator  of  Scott 
(1967)  produced  a  series  of  line  scanned 
imagery  typically  illustrated  by  the 
few  samples  in  Figs.  1  and  2.  It  is 
obvious  that  there  are  other  more  basic 
factors  that  drive  the  quality  of  line 
scan  imagery  which  produce  the  dependent 
variables  such  as  "lines Tt  and  "gray 
scale . " 

The  raster  lines  also  produce  false 
imagery  caused  by  the  interaction  of  the 
signal  inherent  in  the  raster  line  fre¬ 
quency  with  frequencies  present  in  the 
imagery,  i.e.,  sum  and  difference  effects 
inherent  in  any  sampling  system. 


Though  the  effect  of  the  sampling 
problem  has  been  explained--f irst  in 
the  middle  thirties  and  again  in  the 
fifties — it  is  coming  up  again  in 
the  seventies  as  serve ral  whole  new 
families  of  flat  displays  or  picture 
tubes  emerge.  These  new  displays 
like  those  in  small  computers,  cash 
registers  and  the  like  are  composed  of 
a  large  number  of  individual  elements 
that  glow  more  or  less  brightly  to 
form  a  picture.  They  are  displays 
made  of  many  discrete  elements  and 
differ  from  usual  images  in  the  way 
that  newsprint  pictures  differ  from 
good  photographs .  That  is  ,  the 
images  are  sampled  rather  than  con¬ 
tinuous  and  sampling  can  produce  lots 
of  problems  whether  the  sampling  is 
only  in  one  direction  like  the  lines 
in  a  television  picture  or  in  two 
dimensions  like  the  dots  in  a  news¬ 
print  picture.  The  more  distinct 
and  sharp  these  lines  or  dots  are, 
the  more  trouble  they  cause.  Also 
the  farther  apart  or  the  coarser 
the  pattern,  the  worse  the  troubles. 
This  is  obvious,  see  figures  on 
raster  and  Moire  effects,  Figs,  3 
and  4,  but  it  is  really  a  small  part 
of  the  whole  story. 

These  factors  were  predicted 
before  the  birth  of  television  as  we 
now  know  it.  The  difficulties  due 
to  sampling  in  a  series  of  discon¬ 
tinuous  lines  were  clearly  set  forth. 

Mertz  and  Gray  in  1934  predicted 
the  raster  effect  on  image  quality. 
Twenty  years  later,  after  patenting 
means  to  remove  some  of  the  injurious 
effects  predicted  by  Mertz  and  Gray, 
Otto  Schade  (1953)  published  a  com¬ 
plete  quantitative  treatment  supple¬ 
menting  the  paper  of  Mertz  and  Gray. 

Four  years  after  Schade ?s  paper, 
F.  T.  Thompson  (1957)  reported  on 
personal  preferences  in  viewing  tele¬ 
vision  imagery  and  the  effect  that 
removing  raster  structure  had  upon 
these  preferences. 

Thompson  describes  a  variety  of 
means  for  accomplishing  a  reduction 
in  the  appearance  of  individual  lines. 
He  conducted  a  series  of  experiments 
with  various  raster  frequencies.  The 
results  strongly  confirm  each  other 
and  thus  we  quote  only  his  results. 
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Figure  1 .  Line-Scan  Image  Made  With 
LSIG  in  Electronic  Mode  Illustrating 
Low  Spatial- Frequency  Attenuation 
(1.50  scans  per  mm) 


Figure  2.  Line-Scan  Image 
LSIG  in  Electronic  Mode  — 
Noise  Added  (1.5  scans 


Made  With 
Gaussian 
per  mm) 
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Figure  4a.  A  Concentric  Pattern  for 
Moire  Experiments 


Figure  3c.  Yields  This 
Image 


False  Images  Resulting  from  Aliasing 
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"It  would  be  desirable  to  suppress 
or  eliminate  the  line  structure  in 
television  pictures  thereby  allow¬ 
ing  the  larger  receivers  to  be  com¬ 
fortably  viewed  at  reduced  distances." 

Thompson  thus  indicated  that  the 
viewers  moved  away  from  the  television 
set  until  they  could  no  longer  see  the 
raster  lines  (or  the  equivalent  fine 
image  structure  along  a  line).  When  the 
raster  was  removed,  the  viewers  moved 
back  to  the  same  shorter  distance  they 
chose  to  observe  raster-free  imagery  and 
thus  could  observe  greater  detail.  These 
effects  are  shown  in  Figs.  5  and  6  from 
Thompson  T s  paper . 


Figure  5.  Observers  Sitting  Closer  Than  a 
Given  Distance  (525  Line  Raster 
on  24  Inch  Receiver) 


VIEWING  DISTANCE,  SPOT  WO— LE  PICTU«£ 

VIEWING  DISTANCE,  CONVENTIONAL  fICTUtE 


Figure  6.  Effect  on  Spot  Wobble  on 
Viewing  Distance  (525  Line  Raster) 


Fifteen  years  after  Thompson Ts 
experiment  and  eighteen  years  after 
SchadeTs  SMPTE  paper,  there  still  is 
a  determined  effort  to  get  more  infor¬ 
mation  to  pilots  in  aircraft  through 
television  or  similar  image-forming 
equipment.  Though  Schade  has  done  the 
calculations  and  Thompson  has  done  an 
irrefutable  experiment,  system  designers 
still  strive  for  greater  amounts  of 
"limiting  resolution"  in  camera  tubes 
and  ignore  the  effect  of  the  raster  in 
the  camera  tube  and  especially  in  the 
display. 

In  1969,  in  search  of  a  new  do¬ 
mestic  television  set,  I  read  ratings, 
talked  to  knowledgeable  colleagues  and 
went  out  to  shop  for  a  color  television 
receiver.  In  the  course  of  visiting  a 
number  of  stores,  I  was  strongly  im¬ 
pressed  by  the  actions  and  apparent 
criteria  of  many  other  shoppers .  A 
surprisingly  large  number  of  prospec¬ 
tive  buyers  got  up  close  to  the  picture 
tube  and  examined  the  raster  structure. 
Most  seemed  very  pleased  with  pictures 
in  which  the  "lines"  were  sharp  and 
clean.  I  intruded  to  ask  a  number  of 
buyers  so  engaged  how  they  were  making 
their  choice  since  they  obviously  were 
making  a  close  technical  (if  subjective) 
comparison.  Flattered  at  my  recognition 
of  them  as  men  of  discrimination  and 
sound  judgement,  they  invariably  pointed 
out  to  me  that  they  were  looking  for  a 
set  that  was  in  good  adjustment  and 
sharp  focus — "just  look  at  how  sharp 
the  lines  are, --real  good  detail  in 
that  set. " 

Now  let  me  show  you  how  silly  this 
criterion  really  is.  Figure  J  shows  a 
picture  of  a  map  as  seen  on  a  two  dimen¬ 
sional  matrix  display.  The  whole  picture 
is  quite  good.  It's  just  hard  to  read 
detail. 


Figure  7.  Map  -  Two  Dimensional 
Matrix  Display 
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The  second  figure  is  a  good  magni¬ 
fied  picture  of  a  small  section.  There 
is  no  more  detail  because  the  raster 
gets  in  the  way.  If  I  defocus  the  pic¬ 
ture  and  let  everything  get  blurry,  the 
fine  raster  lines  get  blurry  faster  than 
the  larger  type  and  we  can  read  more 
than  we  previously  could  with  a  sharp 
picture.  (See  Figs.  8  and  9.) 


Figure  8a.  One  Dimensional  Raster 
Imagery  Buried  in  Raster 


Well-designed  displays  should 
never  have  a  raster  pattern  that  is  vis¬ 
ible  from  a  normal  viewing  distance. 

The  displays  we  get  will  continue  to  be 
poor  as  long  as  we  don’t  understand  these 
facts  and  are  willing  to  buy  television 
sets  that  mutilate  imagery  with  sharp 
black  and  white  lines.  One  must  be  sure 
that  the  set  has  fuzzy  lines  that  are 
almost  invisible  but  a  sharp  picture, 

NOT  vice  versa. 

At  the  risk  of  becoming  somewhat 
less  than  accurate  let  me  explain  that 
the  effect  of  the  sharp  lines  in  a  pic¬ 
ture  are  somewhat  like  a  piercing  whis¬ 
tle  that  bothers  us  when  we  are  trying 
to  listen  to  code  on  a  radio. 

Remove  the  piercing  scream  of  the 
whistle  and  the  soft  code  can  be  heard 
distinctly.  In  the  same  way  remove 
the  "loud"  black  and  white  lines,  or 
make  them  softer  to  be  able  to  "hear" 
the  "softer"  images  on  the  tube. 

As  the  new  displays  come  into  the 
development  cycle  it  is  most  important 
that  the  line  structure  be  minimized 
in  order  to  maximize  the  usefulness 
of  the  new  displays  to  the  human  viewer. 


Figure  8b.  Two  Dimensional  Raster 
Imagery  Buried  in  Raster 
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Figure  9.  Defocussing  Blurs  Raster 
Faster  Than  Type 

Defocussing  loses  information,  but  loses 
it  faster  at  high  frequencies;  thus,  de¬ 
focussing  loses  raster  before  it  loses 
the  larger  type  on  the  maps. 


We  have  said  necessary  but  nasty 
things  about  lines  in  sampled  imagery. 
We  feel  that  the  previous  experiments 
overlooked  the  fact  that  the  raster 
lines  need  not,  and  should  not,  be 
visible. 

Thus,  though  we  do  not  disagree 
with  the  conclusions  drawn  from  the 
psychophysical  television  experiments 
previously  noted,  we  would  like  to 
point  out  that  by  changing  the  proper¬ 
ties  of  the  spot  sizes  of  "apertures" 
used  with  a  display  raster,  but  not  the 
raster  frequency  or  basic  form,  the 
results  would  be  strongly  affected. 

Actually,  one  must  conclude  that  the 
angular  subtense  and  number  of  raster 
lines  do  affect  the  legibility  of 
symbols,  however,  other  primary,  more 
basic  factors  of  the  raster  can  have 
equal  or  greater  effect  and  thus  the 
raster  parameters  must  be  specified. 

Now,  let  me  show  a  few  slides  and 
perform  a  few  fun  experiments  with  you. 
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First  let  me  show  the  statisti¬ 
cal  aspects  of  video  communications 
in  a  slide  made,  I  believe,  by 
George  Morton  for  his  magnificent 
article  of  Applied  Optics  a  few  years 
back. 

Figure  10  shows  the  resulting 
effect  on  information  content  in  an 
image  caused  by  limiting  the  number 
of  bits  of  light  or  photons  making 
up  the  image.  It’s  a  beautiful 
example  of  Poisson  statistics. 

Next  let  me  show  you  the  effect 
of  too  much  of  a  good  thing- -more 
light--and  what  modulation  in  imagery 
means  to  you. 


3.6  x  10  Photons 


Look  at  the  next  series  of  fig¬ 
ures  ,  Fig.  11,  same  as  before  but 
now  let  me  throw  some  white  noise  on 
the  subject.  Please  note  I  have  not 
removed  the  information  present  in 
the  f igures--it Ts  still  all  there. 
It’s  just  that  you  with  your  people¬ 
sized  computational  center  between 
your  eyes  can  no  longer  meaningfully 
process  in  light  patterns  when  the 
random  variations  inherent  in  the 
high  level  of  flux  from  our  second 
source  equal  or  exceed  those  varia¬ 
tions  that  convey  information  to  us . 
You  could  do  the  job  but  only  with 
much  more  processing  or  longer  inte¬ 
gration  times . 


Figure  10.  Images  Formed  by  Scintillations  -  Figure  11.  Effect  of  Adding 

Series  of  Photographs  Showing  the  Quality  Noise  (more  unfocussed  light) 

of  Pictures  Obtained  with  Various  Numbers  to  Imagery  Shown  on  Left 

of  Photons  (or  photoelectrons  when  the 
quantum  efficiency  is  less  than  unity) 


45 


It  is  a  little  like  listening 
to  an  itinerant  violinist  on  a  sub¬ 
way  platform  as  the  express  passes. 

I'm  sure  that  the  violinist's  talent 
did  not  degrade  as  the  train  rushed 
by.  Let's  listen  to  that  analog. 

In  vision,  it's  like  my  imagery  now 
on  the  screen. 

The  problem  can  be  and  is  much  more 
selective  and  should  be  clearly  under¬ 
stood,  but  I  very  much  doubt  if  anyone 
here  ever  took  the  trouble  to  remedy  the 
situation. 

I'm  talking  about  the  non-linear 
effects  that  occur  when  one  acoustic 
frequency  greatly  exceeds  in  amplitude 
a  reasonably  good  signal,  perhaps  a 
couple  of  octaves  removed. 

Like  what?  Like  a  5XC  hiss  in  a 
super-regenerative  receiver  saturating 
your  neural  networks  so  that  you  can't 
read  the  coded  signals.  How  do  you 
clean  that  up?  Filters,  of  course. 

In  acoustics  it's  like  this.  (Audio 
demonstration.)  How  in  Optics?  See 
Figure  12. 


Figure  12.  Moire  Patterns  From  Plowed 
Fields  Due  to  Sampling 


The  fascinating  work  that  is  go¬ 
ing  on  now  in  careful  psychophysical 
experiments  performed  by  a  combina¬ 
tion  of  people  who  understand  both 
physics  and  good  experimental  design 
practice  in  psychology  are  yielding 
good  hard  meaningful  results. 

Unfortunately  in  the  recent  past 
too  much  work  has  been  done  correlat¬ 
ing  human  visual  performance  primarily 
with  things  like  the  number  of  raster 
lines  or  shades  of  gray  in  television 
imagery.  I  hope  my  earlier  demonstra¬ 
tions  convinced  you  that  nothing  is 
important  about  either  one  unless  one 
understands  the  signal  level  inherent 
in  the  raster  and  in  the  image  to  be 
viewed  in  spite  of  that  raster.  The 
shades  of  gray  likewise  are  directly 
related  to  the  dynamic  range  of  sig¬ 
nal  to  noise  ratio. 


In  the  experimental  material 
reviewed  earlier  the  relationship  of 
the  viewer  and  his  ability  to  accom¬ 
plish  certain,  visual  tasks  was  con¬ 
sistently  related  to  a  number  of 
parameters  primarily  the  number  of 
scan  lines  and  the  number  of  shades 
of  gray  in  the  image. 


Let  me  repeat,  though  we  do  not 
disagree  with  the  conclusions  drawn 
from  these  particular  experiments  we 
would  like  to  point  out  that  by 
changing  the  signa 1-to-noise  ratio 
and/or  the  properties  of  the  spot 
sizes  or  "apertures"  used  with  a 
display  raster,  but  not  the  raster 
frequency  or  basic  form,  the  results 
would  be  strongly  affected.  Actually 
one  must  conclude  that  the  angular 
subtense  and  number  of  raster  lines 
do  affect  the  legibility  of  symbols, 
however,  other  basic  factors  of  the 
raster  can  have  equal  or  greater 
effect  and  thus  the  raster  parameters 
must  be  specified. 


For  the  particular  conditions 
of  the  experiment,  the  MTF  of  the 
camera  tube,  the  effective  pre¬ 
filtering  of  the  lens  used,  etc., 
result  in  a  level  of  aliased  signal 
that  causes  definite  and  perceptible 
deterioration  in  image  quality  of 
the  type  that  Schade  shows  in  his 
examples,  shown  in  Figure  13. 
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Figure  13.  Two  Examples  of  Image  Quality  as  a  Function  of  Signal- to- 
Noise  Ratio  for  a  Fixed  Line  Number  Display 
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It  therefore  seems  a  fruitless  task 
to  relate  visual  capability  to  raster 
lines  in  general  when  such  an  effect  is 
highly  related  to  the  design  and  adjust¬ 
ment  of  an  individual  specific  television 
system.  It  is  clear  that  the  raster  vis¬ 
ibility  will  be  an  uncertain,  variable, 
and  real  problem  as  long  as  there  is 
little  or  no  concern  about  its  existence 
or  degree  of  interf erence--and  presently 
there  is  almost  no  such  concern,  since 
by  and  large  the  buyers  of  television 
sets  designed  for  domestic  entertain¬ 
ment  not  only  accept  as  fundamental  the 
presence  of  the  raster  lines  but  often 
choose  a  set  on  the  supposition  that  a 
clear,  sharp,  highly  visible  raster  is 
an  indication  of  a  well-designed  set  in 
good  adjustment. 

Much  thought  needs  to  be  given 
about  the  real  findings  of  the  experi¬ 
ments  discussed  earlier.  As  is  shown 
by  Schnitzler  and  Rosell,  modulation 
and/or  signal-to-noise  ratio  can  be 
used  to  indicate  the  limitations  in 
both  "resolution"  and  "shades  of  gray" 
in  imagery. 

It  is  clear  that  the  recognition 
process  and  related  visual  processes 
are  related  to  "resolution"  and  "shades 
of  gray"  but  it  is  equally  clear  that 
both  of  these  are  dependent  upon  the 
signal-to-noise  ratio  in  the  imagery  as 
a  function  of  spatial  frequency,  the 
latter  determining  the  former. 

It  is  interesting  to  note  that 
shades  of  gray  represent  steps  of  ./~2 
in  luminance  of  the  image  or  resolution 
element.  Obviously  with  a  limited  sig- 
nal-to-noise  ratio  the  number  of  shades 
of  gray  are  automatically  limited;  two 
shades  of  gray  calling  for  a  dynamic 
range  in  S/N  of  ( */  2  ) 2  or  2:1  while 
eight  shades  of  gray  call  for  (*/  2  )8 
or  a  signal-to-noise  dynamic  range  of 
16: 1. 

The  interaction  between  object  size 
and  shades  of  gray  is  highly  predictable. 
Since  the  response  of  a  television  sys¬ 
tem  decreases  in  a  predictable  manner 
as  image  sizes  become  smaller,  signal- 
to-noise  ratios  decrease  as  images  be¬ 
come  smaller  until  finally  the  signal- 
to-noise  ratio  drops  to  a  just  detect¬ 
able  level  when  the  size  of  the  target 


reaches  the  "limiting  resolution"  of 
the  system.  This  point  is  a  function 
of  the  radiance  of  the  object  and  its 
contrast  against  the  background  and 
the  characteristics  of  the  TV  equip¬ 
ment.  At  limiting  resolution  there 
normally  is  no  gray  scale.  There  is 
less  than  one  shade  of  gray  differ¬ 
ence  between  target  and  background, 
for  anything  smaller,  the  signal-to- 
noise  ratio  drops  below  a  detectable 
SNR  or,  as  more  usually  expressed, 
below  a  discernible  contrast  level 
at  the  given  level  of  display 
brightness . 

If  the  properties  of  the  experi¬ 
mental  equipment  were  known,  the  re¬ 
sults  obtained  should  be  predictable 
from  these  parameters  as  well  as  from 
the  derived  parameters  of  resolution 
and  shades  of  gray,  which  are  sub¬ 
jective  manifestations  of  signal  to 
noise . 

As  stated  before,  a  TV  system 
with  the  lens  capped,  produces  the 
same  display  raster,  with  the  same 
bright  and  dark  horizontal  stripes 
as  the  raster  associated  with  a 
bright  and  clear  image.  The  first 
has  a  good  signal-to-noise  ratio  but 
only  at  the  raster  frequency  and  then 
only  in  the  vertical  direction  and 
thus  has  no  informative  content. 

The  experimental  and  theoretical 
work  of  Rosell  and  the  theoretical 
background  derived  from  deVries,  Rose, 
Schade,  Coltman  and  others  do  not 
directly  of  themselves  yield  an  esti¬ 
mate  of  the  performance  of  a  man  in, 
for  example,  distinguishing  between 
the  body  styles  of  various  auto¬ 
mobiles.  Yet  that  same  set  of  theories 
coupled  with  a  knowledge  of  the  equip¬ 
ment  characteristics  and  the  scene 
characteristics  allow  one  to  predict 
the  signal-to-noise  ratio  vs  fre¬ 
quency  on  the  display  of  the  given 
equipment,  the  raster  parameters  and 
therefore  the  number  of  lines  on  a 
given  image,  and  the  shades  of  gray 
in  that  image. 

If  the  latter  two  determine  the 
informative  content  of  imagery,  the 
combination  of  a  knowledge  of  the 
scene  and  the  equipment  should  allow 
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one  to  predict  the  level  of  achievement 
of  the  given  visual  task  which  was  pre¬ 
dictable  from  the  subjective  character¬ 
istics.  Perhaps  more  to  the  point  know¬ 
ing  the  characteristics  of  the  scene  and 
the  observers1  requirements,  one  should 
be  able  to  specify  the  level  of  system 
specification  necessary  to  achieve  a 
given  visual  task. 

Actually  the  computational  process 
for  determining  probability  of  detection 
or  identification,  or  the  inverse  prob¬ 
lems  of  determining  system  parameters 
necessary  to  achieve  the  required  prob¬ 
ability  is  considerably  simpler  than  it 
sounds  in  the  above  discussion.  The 
process  and  methods  of  calculation  are 
presented  in  a  straightforward  manner 
towards  the  end  of  RoseH*  s  Chapter  5 
in  "Perception  of  Displayed  Information" 
by  Biberman  et  al. 

If  there  are  those  who  still  believe 
that  raster  lines  really  do  control  an 
observer Ts  ability  let  me  show  a  series 
of  slides  in  which  everything  is  constant 
except  the  signal-to-noise  ratio. 

Finally  let  me  show  you  a  great  pic¬ 
ture  where  the  number  of  lines  across  a 
subject  meets  all  requirements  but  the 
signal  to  noise  is  zero. 


Figure  14.  Picture  of  a  Tank  at 
1000  Meters,  120  TV  Lines  per 
Tank  Dimension,  Signal- to-Noise 
Equal  to  Zero 
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ABSTRACT 

The  operational  performance  of  an 
image-detecting  and  display  system  is 
measured  by  the  probability  of  acquir¬ 
ing  a  target  and  the  false  alarm  proba¬ 
bility.  Performance  depends  on  a  large 
number  of  factors  concerned  with  the 
scene,  the  atmosphere,  the  electro- 
optical  imaging  system,  the  visual 
system,  and  the  search  process  auto¬ 
matically  followed  by  the  user. 
Motivated  by  knowledge  of  the  search 
process  and  the  structure  and  opera¬ 
tion  of  the  visual  system  the  proba¬ 
bility  of  acquiring  a  target  can  be 
decomposed  into  two  factors,  viz., 
the  foveal  imaging  probability  and  the 
foveal  detection  probability.  Both  are 
subject  to  analysis  by  extension  of  the 
standard  techniques  of  statistical  com¬ 
munication  theory.  Although  this  ana¬ 
lytical  method  is  very  general,  as  an 
example,  it  is  applied  to  a  low  light 
level  television  system. 


INTRODUCTION 

Correctly  predicting  the  operation¬ 
al  performance  of  image-detecting 
systems  and  choosing  the  most  direct 
and  efficient  means  to  improve  on  it, 
depends  on  the  development  of  a  rigor¬ 
ous  analytical  expression  relating 
operational  performance  to  target  and 
background  characteristics  and  to 
system  parameters. 

Image-detecting  systems  generally 
fall  into  three  categories: 


•  The  unaided  visual  system. 

•  The  visual  system  optically 
aided  by  a  binocular  or  a 
telescope. 

•  The  visual  system  electro- 
optipally  aided  by  an  image 
intensifier,  a  television  or 
an  infrared  converter  system. 

The  principal  components  of  the 
electrooptically  aided  visual  system 
are  indicated  in  Fig.  1.  First,  we 
have  the  objective  for  collecting 
radiant  power  and  forming  an  image  on 
the  primary  sensor  and  which  with  its 
primary  lens  or  mirror  generally 
serves  as  the  aperture  stop,  deter¬ 
mining  the  collection  efficiency  by 
the  solid  angle  it  subtends  at  the 
target.  Next,  we  have  the  primary 
sensor  which  functions  as  a  radiant 
power-to-electrical  excitation 
transducer  and  which  serves  as  the 
field  stop  of  the  system,  determining 
the  angular  f ield-of-view  by  the  solid 
angle  which  its  image  in  object  space 
subtends  at  the  objective.  Then  we 
have  the  display  for  converting 
electrical  power  into  luminous  power 
and  which  in  conjunction  with  the 
viewing  distance  determines  the 
utilized  angular  f ield-of-view  of 
the  visual  system  by  the  solid  angle 
which  it  subtends  at  the  entrance 
pupil  of  the  eye.  The  subjective 
magnification  is  the  ratio  of  the 
utilized  linear  angular  f ield-of-view 
of  the  visual  system  in  display  space 
to  the  linear  angular  f ield-of-view  of 
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the  electrooptical  imaging  system  in 
object  space.  Next,  we  have  the  visual 
system  for  collecting  luminous  power 
from  the  display  and  forming  an  image 
on  the  retina  where  electrochemical 
processes  are  initiated  in  the  receptor 
cells  by  the  incident  luminous  power. 

The  collection  efficiency  of  the  visual 
system  is  determined  by  the  solid  angle 
subtended  at  the  display  by  the  entrance 
pupil  of  the  visual  system.  If  an 
ocular  is  employed  as  in  an  image  in- 
tensifier  system,  for  example,  it 
functions  as  a  simple  magnifier,  in¬ 
creasing  the  collection  efficiency  and 
the  utilized  angular  f ield-of-view  of 
the  visual  system  in  display  space. 
Finally,  we  have  within  the  retina 
neurons  which  function  as  threshold 
devices  transmitting  electrical  signals 
on  to  a  higher  level  for  processing  if 
the  electrical  excitations  received 
from  their  photoreceptor  fields  exceed 
their  decision  criteria. 


Figure  1 .  Diagram  of  Composite 
Electro -Optical -Visual 
•System  for 
Image  Detection 


The  purpose  of  an  electrooptical 
imaging  system  is  to  increase  the  ac¬ 
quisition  and  flow  of  information 
from  a  scene  to  the  brain  of  a  user 
over  what  would  occur  if  he  were 
forced  to  rely  on  his  visual  system 
alone.  When  the  purpose  is  stated  in 
these  terms,  an  electrooptical  imaging 
system  is  recognized  as  a  communication 
system.  Hence,  the  measures  of  perfor¬ 
mance,  the  methods  of  analysis  and  the 
methods  of  testing  of  electrooptical 
imaging  systems  are  suggested  by 
consideration  of  electrical  communi¬ 
cation  systems. 


A  powerful  tool  which  has  been 
developed  to  analyze  the  performance  of 
electrical  communication  system  is  pro¬ 
vided  by  statistical  communication  the¬ 
ory.  The  general  objective  of  statisti¬ 
cal  communication  theory  is  to  develop  a 
statistical  description  of  messages  and 
noise  in  order  to  predict  performance 
measured  by  the  detection  and  false 
alarm  probabilities.  A  more  limited 
objective  is  often  pursued  in  which  it 
is  assumed  that  a  known  message  is  being 
transmitted,  but  reception  is  degraded 
by  the  presence  of  noise.  Hence,  in 
this  case,  a  statistical  description  of 
only  the  noise  is  required  in  order  to 
predict  the  detection  and  false  alarm 
probabilities . 

The  definition  of  the  performance 
measures  of  an  image-detecting  system 
depend  on  the  visual  task  described, 
but  in  general  they  are  defined  by  the 
target  acquisition  and  false  alarm 
probabilities.  The  target  acquisition 
task  may  range  from  merely  detecting 
the  presence  of  a  target  in  an  un¬ 
structured  scene  to  precise  identifi¬ 
cation  of  a  particular  target  among  a 
set  of  closely  similar  targets  in  a 
cluttered  scene.  In  any  case  target 
acquisition  is  accomplished  with  an 
electrooptical  imaging  system  by 
visually  examining  the  display  upon 
which  the  angular  size  and  the  con¬ 
trast  of  the  image  of  the  target  are 
larger  than  that  of  the  target  viewed 
directly  by  the  unaided  visual  system. 

Clearly,  to  develop  a  rigorous 
analytical  expression  relating  opera¬ 
tional  performance  to  target  and 
background  characteristics  and  to 
system  parameters  a  large  number  of 
factors  must  be  considered: 

•  The  nature  of  the  radiant  power 
transmitted  from  the  scene  - 
especially  its  spectral  content. 

•  The  spatial  variation  in  re¬ 
flection  and/or  emission  of 
radiant  power  by  targets  and 
backgrounds  to  form  the  optical 
signal  transmitted  from  the 
scene . 
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•  The  attenuation  of  the  optical 
signal  by  absorption  and 
scattering  of  radiant  power  in 
the  intervening  atmosphere. 

•  The  attenuation  of  the  optical 
signal  due  to  the  limited 
aperture  size  and  aberrations 
of  the  objective  of  the  image¬ 
detecting  system. 

•  The  efficiency  and  fidelity  of 
the  conversion  of  the  optical 
signal  into  an  electrical  signal 
and  the  generation  of  electrical 
noise  in  the  electrooptical 
imaging  system. 

•  The  efficiency  and  fidelity  of 
the  conversion  of  the  electrical 
signal  into  a  luminous  signal 

at  the  display  of  the  electro- 
optical  imaging  system. 

•  The  attenuation  of  the  optical 
signal  and  noise  due  to  the 
limited  aperture  size  and 
aberrations  of  the  visual 
system. 

•  The  technique  employed  by  the 
user  to  acquire  the  images  of 
targets  of  interest  reproduced 
on  the  display  which  in  turn 
depends  on  the  complexity  of 
the  scene  and  the  anticipated 
size  and  contrast  of  the 
target. 

In  order  to  simplify  the  problem 
of  developing  a  rigorous  analytical 
expression  relating  operational  per¬ 
formance  to  all  of  the  above  factors, 
it  will  be  convenient  to  decompose 
the  probability  of  acquiring  a  target 
into  two  independent  factors.  Such  a 
decomposition  is  motivated  by  the 
knowledge  that  the  probability  of 
detecting  a  target  of  size  and  con¬ 
trast  near  the  detection  threshold  at 
ordinary  values  of  display  luminance 
is  much  greater  if  the  image  of  the 
target  falls  on  the  foveal  region  of 
the  retina.  In  fact,  foveal  vision 
is  so  much  more  sensitive  than  extra- 
foveal  vision  that  the  muscles 


controlling  the  eye  involuntarily 
rotate  the  eyeball  from  one  fixation 
point  to  another  in  nearly  discrete 
steps  allowing  approximately  three 
fixations  per  second.  Thus,  the 
process  of  acquiring  a  target  near  the 
sensitivity  threshold  by  the  visual 
system  consists  of  a  series  of  fixa¬ 
tions  with  a  relatively  small  angular 
aperture  corresponding  to  the  fovea 
at  a  series  of  points  within  the  larger 
solid  angle  corresponding  to  the  dis¬ 
play  at  the  rate  of  approximately  three 
fixations  per  second.  According  to 
this  process  the  probability  of  ac¬ 
quiring  a  target  depends  on  the  proba¬ 
bility  that  in  a  series  of  fixations 
on  points  of  the  display  the  target 
image  falls  on  the  fovea  and  on  the 
probability  that  whenever  the  target 
image  falls  on  the  fovea,  it  will  be 
detected.  The  process  is  described 

by 

pA(t)  =  psct)pD  (i) 

where  P^(t)  and  Ps(t)  are  the  acqui¬ 
sition  probability  and  the  proba¬ 
bility  that  the  target  image  falls 
on  the  fovea,  respectively,  in  a 
series  of  fixations  of  total  dura¬ 
tion  t  and  Pp  is  the  probability 

that  the  target  image  on  the  fovea 
is  detected.  As  the  size  and  con¬ 
trast  of  the  target  increase  the 
probability  of  extra-foveal  detection 
becomes  significant  so  that  only  one 
fixation  is  required  to  acquire  the 
target  and  Pg(t)  as  well  as  ap¬ 
proaches  unity. 


It  may  be  noted  that  Eq.  (1) 
differs  with  BaileyTs  Eq.  (1) 
describing  his  target  acquisition 
model  (Bailey,  1970)  by  omitting  a 
conditional  probability  factor  at¬ 
tributed  to  recognition  once  de¬ 
tection  has  occurred  and  an  overall 
degradation  factor  attributed  to 
noise.  The  first  factor  is  omitted 
because  recognition  results  from 
detection  of  details  of  target  shape 
or  structure  which  serve  to  establish 
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recognition  of  the  target.  Like¬ 
wise,  identification  results  from 
detection  of  smaller  details  of 
target  shape  or  structure  which 
serve  to  establish  identification 
of  the  target.  Obviously,  the  set 
of  target  details  required  for 
recognition  or  identification  will 
depend  on  the  target  set  to  which 
a  particular  target  belongs.  If  the 
target  details  required  for  recog¬ 
nition  or  identification  are  of 
sufficient  size  and  contrast  to 
permit  a  reasonable  detection 
probability,  detection  of  the  target 
as  a  whole  does  not  occur.  Rather, 
at  the  primary  decision  level  in 
the  retina  detection  of  the  set  of 
target  details  occurs  and  a  corre¬ 
sponding  set  of  signals  is  trans¬ 
mitted  to  the  higher  decision  levels 
in  the  brain  where  the  signal 
processing  and  comparison  with 
memory  required  for  recognition  or 
identification  occur.  Mere  detection 
of  the  target  per  se  is  irrelevant  in 
this  process.  Thus,  recognition  or 
identification  are  not  conditional  on 
detection  of  the  target  and  if  the 
visual  task  is  recognition  or  identi¬ 
fication,  it  should  be  realized  that  in 
Eq.  (1)  Pp  symbolizes  the  detection 

of  the  requisite  details.  However,  to 
avoid  confusion  either  the  symbol  P^, 

PR  or  P^  can  be  used  in  Eq.  (1)  to 

designate  the  probability  either  of 
detecting  the  entire  target,  of  de¬ 
tecting  details  characteristic  of 
recognition,  or  of  detecting  details 
characteristic  of  identification, 
respectively . 

Omission  of  the  overall  degradation 
factor  attributed  to  noise  by  Bailey  is 
made  because  the  effect  of  noise  is 
properly  accounted  for  in  the  detection 
probability.  Indeed,  the  detection 
probability  is  determined  by  the 
signal-to-noise  ratio  in  conjunction 
with  the  decision  criterion  at  the 
primary  decision  level  in  the  neurons 
of  the  retina.  In  the  absence  of 
noise,  no  matter  how  small  in  size  and 
contrast  a  target  were  found,  the 
detection  probability  would  be  unity. 

Now  that  the  acquisition  proba¬ 
bility  has  been  logically  decomposed 
into  two  factors,  each  factor  in  turn 
will  be  analyzed  in  detail. 


Foveal  Imaging  Probability 

The  derivation  of  the  cumulative 
probability  that  in  a  series  of  fixa¬ 
tions  of  total  duration  t  ,  the  image 
of  a  target  falls  on  the  foveal  region 
of  the  retina  is  essentially  contained 
in  Davies1  derivation  of  the  equations 
of  glimpse  theory  (Davies,  1968),  How¬ 
ever,  to  clarify  concepts  relevant  to 
the  above  description  of  the  target 
acquisition  process,  a  brief  derivation 
of  the  foveal  imaging  probability 
Pg(t)  is  presented  here. 

In  order  to  derive  an  expression 
for  the  foveal  imaging  probability  in 
accordance  with  the  above  description 
of  the  target  acquisition  process  it 
is  convenient  to  let  p^  be  the 

probability  that  the  target  image 
falls  on  the  fovea  in  the  kth  fixa¬ 
tion.  Then  (1  -  p^)  is  the  proba¬ 
bility  of  not  imaging  the  target  on 
the  fovea  in  the  kth  fixation. 

If  the  size  and  contrast  of  the 
target  are  relatively  small  so  that 
the  probability  of  detection  when 
the  image  falls  outside  the  fovea  is 
nil,  then  it  is  reasonable  to  assume 
that  the  series  of  fixations  points 
will  be  randomly  distributed  over  the 
display.  Moreover,  experimental 
results  tend  to  be  in  conformity  with 
random  fixation  points  (Davies,  1968). 
In  this  case  the  cumulative  proba¬ 
bility  of  not  imaging  the  target  on 
the  fovea  in  n  fixations  is  given 
n 

by  the  product  IT  (1  -  p,  ).  Thus  • 
k=l  * 

the  cumulative  foveal  imaging  proba¬ 
bility  in  n  fixations  is  given  by 
n 

Pq(n)  =  1  -  TT  (1  -  p  )  (2) 

S  k=l  K 

It  is  reasonable  to  assume  that 
the  foveal  imaging  probability  per 
fixation  is  a  constant,  i.e.,  that 
p^  =  p  independently  of  the  number 

of  the  fixation  in  the  series.  In  ad¬ 
dition  if  the  duration  of  a  fixation  is 
designated  by  T  and  the  elapsed 
search  time  by  t  ,  then  n  =  t/T  and 
the  cumulative  foveal  imaging  proba¬ 
bility  is  given  by 
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Ps(t)  =  1  -  (1  -  p)t/T  (3) 

The  quantity  (1  -  p)t^T  can 
always  be  expressed  in  exponential 
form  by  setting  it  equal  to  exp(u) 
and  solving  for  u  which  is  found 
to  be  given  by  u  =  (t/T)ln(l  -  p). 

Thus,  Eq.  (3)  can  be  written  as 

Ps(t)  =  1  -  exp( -t/ t )  (4) 

where  t  =  -T/ln(l  -  p). 

If  the  solid  angle  subtended  by 
the  image  of  the  fovea  on  the  display 
and  the  solid  angle  subtended  by  the 
display  at  the  entrance  pupil  of  the 
eye  are  designated  by  uu  and  ft  , 
respectively,  then  p  =  uu/ft  .  In  addi¬ 
tion  if  uu  is  small  compared  to  ft  , 
then  ln(  1  -  p )  ~  -  p  and  t  c*  ftT/uu  . 
Finally,  in  conformity  with  this  ap¬ 
proximation  the  cumulative  foveal 
image  probability  is  given  by 

Ps(t)  =  1  -  exp(  -uut/ftT )  (5) 

As  an  example,  suppose  the  display 
subtends  approximately  15  degrees  or 
6xl0“2  steradians.  Now  it  is  known  that 
the  most  sensitive  part  of  the  fovea 
the  fovea  centralis  subtends  approxi¬ 
mately  0.9°  and  the  entire  fovea 
approximately  7°.  Thus,  the  portion 
of  the  fovea  employed  in  the  target 
acquisition  process  will  range  from 
0.9°  to  7°  depending  on  the  size  and 
contrast  of  the  target.  For  this  ex¬ 
ample  let  us  assume  that  the  fovea 
subtends  3°  or  approximately  2x10“^ 
steradians  and  the  allowed  search  time 
is  10  seconds.  Since,  in  addition,  the 
duration  of  a  fixation  is  well-known 
to  be  approximately  1/3  seconds, 

Pg  =  0.632. 

Rather  than  choosing  the  search 
time  and  solving  for  the  foveal  imaging 
probability  it  is  interesting  to  choose 
the  foveal  imaging  probability  and 
solve  for  the  search  time.  The  ex¬ 
pression  for  the  search  time  is  given 
by 

t(p  )  =  ( -OT/uu) ln(l  -  Ps)  (6) 

In  the  above  example  if  the  required 
foveal  imaging  probability  is  chosen  to 
be  0.90  or  0.99,  the  required  search 
time  is  found  to  be  23  sec.  or  46  sec.. 


respectively.  However,  before  drawing 
any  conclusions  concerning  optimum 
angular  size  of  the  display  from  this 
example  it  is  necessary  to  analyze  the 
foveal  detection  probability. 

Foveal  Detection  Probability 

In  order  to  derive  the  foveal 
detection  probability  it  is  necessary 
to  define  the  meaning  of  a  statistical 
unit  in  an  image-detecting  system.  A 
statistical  unit  is  an  aggregate  of 
quanta  or  electrons  which  act  together 
as  a  single  unit  or  packet.  In  an 
electrooptical  imaging  system,  at  the 
primary  sensor  the  statistical  units 
are  the  primary  electrons  excited  by 
the  incident  radiant  power  collected 
from  the  scene.  Each  primary  electron 
is  excited  at  a  point  in  space  and  time 
(of  course,  only  to  the  precision 
permitted  by  the  Heisenberg  uncertainty 
principle).  At  the  display  the  statis¬ 
tical  units  are  the  aggregates  of 
quanta  whose  mean  number  is  equal  to 
the  mean  value  of  the  particle  gain 
or  intensification  gain  between  the 
primary  sensor  and  the  display.  In 
transmission  from  the  primary  sensor 
to  the  display  and  within  the  display 
the  statistical  units  undergo  both 
spatial  and  temporal  dispersion  so 
that  they  now  occupy  a  small  volume  in 
space.  In  transmission  from  the  dis¬ 
play  to  the  retina  and  within  the 
retina  the  statistical  units  undergo 
further  dispersion  in  space  and  time. 

At  the  retina  the  statistical  units 
are  the  aggregates  of  primary  molecular 
excitations  whose  mean  value  is  the 
product  of  the  mean  value  of  a  sta¬ 
tistical  unit  emitted  at  the  display, 
the  collection  efficiency  of  the 
visual  system  and  the  responsive 
quantum  efficiency  of  the  visual 
system. 

If  the  magnitude  of  a  statistical 
unit  at  the  retina  is  greater  than 
unity,  an  image-detecting  system  is 
said  to  be  background  limited  due  to 
shot  noise  or  generation-recombination 
noise  introduced  at  the  primary  sensor 
as  a  function  of  fluctuations  in  the 
background  radiance  of  the  scene. 

This  condition  prevails  in  the  case 
of  the  unaided  visual  system  at  all 
luminance  values  except  near  zero 
where  internal  noise  seems  to  be 
dominant  and  in  the  cases  of  the  low 
light  level  television,  the  image 
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intensifier  and  the  infrared  con¬ 
verter  systems  at  target  and  back¬ 
ground  conditions  where  they  are 
mostly  used.  Under  some  conditions 
image-detecting  systems  such  as  tele¬ 
vision  systems  incorporating  video 
amplifiers  may  not  be  background  limited 
even  though  the  magnitude  of  a  statisti¬ 
cal  unit  at  the  retina  is  greater  than 
unity.  This  situation  exists  where  no 
or  insufficient  intensifier  gain  is 
provided  between  the  primary  sensor 
and  the  video  preamplifier.  Such 
systems  are  said  to  be  video  pre¬ 
amplifier  noise  limited. 


The  magnitude  of  a  statistical  unit 
cannot  be  less  than  unity.  Hence,  if 
the  product  of  the  magnitude  of  a  sta¬ 
tistical  unit  at  the  display,  the 
collection  efficiency  of  the  visual 
system  and  the  responsive  quantum 
efficiency  of  the  visual  system  is 
less  than  unity,  the  product  repre¬ 
sents  the  fraction  of  the  primary 
electrons  which  are  detected  as  sta¬ 
tistical  units  at  the  retina.  In  this 
case  the  image-detecting  system  is  said 
to  be  display  limited  due  to  noise 
generated  in  the  retina  as  a  function 
of  fluctuations  in  the  background 
luminance  of  the  display. 


While  the  structure  of  an  image  at 
the  retina  is  composed  of  statistical 
units,  the  individual  statistical  units 
are  rarely  perceived  except  when  a  very 
low  radiant  power  scene  is  viewed 
through  an  image  intensifier  system, 
a  low  light  level  television  system  or 
when  the  totally  dark  adapted  eye  is 
suddenly  exposed  to  a  room  light  fil¬ 
tered  through  closed  eyelids.  Under 
most  conditions,  the  rate  of  arrival 
of  statistical  units  at  the  retina  per 
unit  area  per  second  and  their  overlap 
in  space  and  time  due  to  spatial  and 
temporal  dispersion  in  both  the  electro- 
optical  and  visual  systems  is  too  great 
for  individual  statistical  units  to  be 
perceptible.  In  addition  electrical 
noise  added  particularly  in  video 
amplifier  systems  will  partially  or 
totally  obscure  the  perception  of 
individual  statistical  units.  Al¬ 
though  individual  statistical  units 
are  rarely  perceptible,  they  are  the 
basic  units  which  compose  an  image. 


To  derive  the  foveal  detection 
probability  it  is  assumed  that  the 
target  or  a  detail  of  the  target  is  a 
patch  of  uniform  differential  radiance 
relative  to  its  local  surroundings 
superimposed  on  a  background  radiance 
nominally  equal  to  the  average  radiance 
of  the  scene.  Furthermore,  it  is  as¬ 
sumed  that  the  boundary  of  the  patch 
can  be  approximated  by  a  simple  geo¬ 
metrical  figure  such  as  a  circle  or  a 
rectangle.  If  the  objective  optics 
were  perfect,  the  temporal  mean  value 
of  the  irradiance  in  the  perfect  image 
formed  on  the  primary  sensor  would  be 
given  by 

n-j-CXjy)  =  nBI  +  An^Xjy)  (7) 

where  nBI  is  the  temporal  mean  value 

of  the  background  irradiance  and 
An^(x,y)  is  the  temporal  mean  value 

of  the  differential  irradiance  in  the 
perfect  image  relative  to  the  back¬ 
ground.  Since  it  is  assumed  that  the 
radiance  of  the  patch  is  uniform, 
An.j.(x,y)  is  a  constant  within  the 

boundary  of  the  perfect  image  and 
zero  elsewhere. 

The  temporal  mean  value  of  the 
number  of  electrons  N  excited  at  the 
primary  sensor  by  the1incident  radiant 
power  within  the  area  a  of  the  perfect 
image  and  during  an  exposure  or  inte¬ 
gration  time  t  is  obtained  by  multi¬ 
plying  Eq.  (7)sby  rlsaITs  where  T)  is 

the  responsive  quantum  efficiency  of 
the  primary  sensor.  However,  in  a 
series  of  exposures  occurring  at  t^, 

t? ,  ...,  t^,  ...  the  number  of  ex- 

excited  electrons  N_(t,  )  is  a  random 
IN  k 

variable  with  a  Poisson  probability 
density.  Thus,  even  though  the  target 
were  absent,  the  number  of  electrons 
excited  within  the  area  of  the  perfect 
image  would  deviate  from  the  temporal 
mean  value  and  would  be  given  by 

NI(tk)  =  NfiI  +  ANx(tk)  (8) 

where  AN^( t^)  is  the  value  of  the 
random  deviation  in  the  kth  exposure. 

If  the  temporal  mean  value  is 
not  too  small,  the  Poisson  probability 
density  of  a  random  variable  is  in¬ 
distinguishable  from  the  more  general 
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Gaussian  probability  density  -  the 
variance  in  this  special  case  being 
equal  to  the  mean  value.  Thus,  in  the 
absence  of  a  target  the  probability 
density  of  the  random  deviation  in  the 
number  of  electrons  excited  per  area 
a^  and  exposure  Tg  is  given  by 

p(ANI)d(ANI)  =  ( 2TT)-1/20g1 

x  exp[-(ANI)2/20g]d(ANI)  (9) 

where  a  is  the  standard  deviation 

—  1/2 
equal  to  (Ngj.)  7  . 

From  Eq.  (9)  the  normalized 
Gaussian  probability  density  is  ob¬ 
tained  by  letting  x  =  AiL/a^  to  obtain 

1  D 

p(x)dx  =  (2TT)"1/2exp(-x2/2)dx  (10) 

If  the  target  is  present  the  devia¬ 
tion  in  the  number  of  excited  electrons 
in  an  exposure  is  approximately  equal 
to  the  algebraic  sum  of  the  random 
deviation  due  to  the  background  and  the 
differential  number  due  to  the  temporal 
mean  value  of  the  differential  irradi- 
ance  of  the  perfect  image  relative  to 
the  background.  This  statement  is  based 
on  the  assumption  that  the  differential 
irradiance  in  the  vicinity  of  the 
threshold  for  detection  is  small  com¬ 
pared  to  the  background.  Later  it  will 
be  shown  that  this  is  a  valid  approxi¬ 
mation  for  real  images  at  the  decision 
level  in  the  output  of  the  visual 
system.  By  the  above  statement  if  the 
target  is  present,  the  random  deviation 
in  the  number  of  excited  electrons  in 
the  kth  exposure  is  given  by  ANj(t  )  - 

AN^  and  its  probability  density  is 

given  by 

p(ANI-ANI)d(ANI)  =  ^ff)'1/2^1 

x  exp[-(ANI-ANI)2/2a2]d(ANI)  (11) 

This  equation  may  be  expressed  as 
a  normal  Gaussian  probability  density 
by  substituting  k^  =  AN^/a^  and  again 

letting  x  =  AN./a-,  to  obtain 

I  D 

p(x-)C[)dx  =  (2TT)“1/2  (12) 

x  exp[ -(  x-kj )2/2 ]dx 


where  ANj  and  k^  are  recognized  as  the 

signal  and  the  signal-to-noise  ratio, 
respectively,  of  the  electron  excita¬ 
tions  at  the  primary  sensor  due  to 
the  target. 

If  an  image-detecting  system 
were  perfect  introducing  no  additive 
electrical  noise  and  no  temporal  or 
spatial  dispersion  and  forming  a 
perfect  image  composed  of  statistical 
units  which  were  points  in  space  and 
time  at  the  retina,  then  the  output 
signal-to-noise  ratio  kQ  at  the  deci¬ 
sion  level  would  equal  the  signal-to- 
noise  k-j.  at  the  primary  sensor.  In 

this  case  the  foveal  detection  and 
false  alarm  probabilities  could  be 
derived  from  the  normal  probability 
density  equations  (10)  and  (12). 

Although  in  practice  spatial  and 
temporal  dispersion  occurs  in  each 
component  of  the  image-detecting 
system  including  the  visual  system 
so  that  at  the  retina  the  real  image 
area  is  greater  than  the  perfect 
image  area  and  the  statistical  units 
have  a  finite  area  and  duration,  the 
probability  density  of  the  random 
deviations  at  the  output  of  the  visual 
system  remains  Gaussian.  Hence,  the 
foveal  detection  and  false  alarm 
probabilities  can  be  derived  from 
Eqs.  (10)  and  (12)  if  k^  is  replaced 

by  kQ  and  an  expression  is  derived 
for  kQ  in  terms  of  target  and  back¬ 
ground  characteristics  and  system 
parameters . 

At  the  decision  level  in  the 
output  of  the  visual  system  a  neuron 
fires  sending  a  message  over  one  of 
the  fibers  of  the  optic  nerve  if  and 
only  if  the  signal  and/or  noise 
levels  exceed  a  threshold  level  known 
as  the  decision  criterion.  If 
ANo ( tk )  is  the  differential  number  of 

molecular  excitations  produced  in  the 
retina  by  the  algebraic  sum  of  the 
random  deviation  in  the  number  due  to 
the  background  in  the  kth  exposure 
and  the  temporal  mean  value  of  the 
number  due  to  the  difference  in 
radiance  between  the  target  and  back¬ 
ground,  then  a  neuron  fires  if  and 
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only  if  ANQ( t^)  ^  ANT  where  ANT  is  the 

decision  criterion.  If  we  divide  by  the 

standard  deviation  a  this  conditional 

o 

equation  becomes  x  ^  k^  where  k^  = 

AN m/o  is  the  decision  criterion  ex- 
T  o 

pressed  as  a  threshold  signal-to-noise 
ratio . 


The  relations  between  the  normal¬ 
ized  Gaussian  probability  densities  and 
the  decision  criterion  are  illustrated 

in  Fig.  2  where  km  and  k  have  been 
To 

chosen  equal  to  2  and  3,  respectively. 
The  false  alarm  probability  is  equal 
to  the  area  in  Fig.  2  under  the  p(x) 
curve  bounded  on  the  left  by  x  =  k^. 
Thus,  we  have 


1 


pp  =  /  p(x)dx 


(13) 


This  integral  is  recognized  as  the 
standard  tabulated  error  function. 
Hence, 

1  -  PF  =  erf(kT)  (14) 


The  detection  probability  is  given 
by  the  area  in  Fig.  2  under  the  p(x-kQ) 


curve  bounded  on  the  left  by  x  = 


k^ 


i.e.  , 


pD  =  /  p(x-kQ)dx  (15) 

x 

Since  p(x-kQ)  is  symmetrical  about  the 

x  =  k^  axis,  this  area  is  the  same  as 

the  area  under  the  p(x-k  )  curve  bounded 
r  o 

on  the  right  by  kQ  +  (kQ  -  kT).  Hence, 


2ko_kT 

pD  =/  p(x-kQ)dx  (16) 


To  express  as  an  error  function  we 
let  u  =  x  -  kQ.  Then  we  have 


k  -k_ 
_o  T 


PD  =/  p(  u)du 


(17) 


Figure  2.  Noise  and  Signal  plus  Noise 
Normal  Gaussian  Probability  Density 
Functions;  is  Threshold  Signal-to- 
Noise  Ratio;  kQ  is  Output  Signal-to- 
Noise  Ratio 


or 

PD  =  erf(kc-kT)  (18) 

It  can  be  shown  (Schnitzler, 

1973)  by  comparison  of  Eq.  (18)  with 
experimental  statistical  detection 
data  reported  by  Blackwell  (1963)  that 
the  value  of  the  threshold  signal-to- 
noise  ratio  falls  in  the  interval  (2, 

3)  for  a  range  of  background  luminance 
from  10 "6  to  102  ft.-L  and  a  range  of 
target  angular  sizes  from  essentially 
a  point  to  102  mr.  The  facts  that  the 
value  of  kT  deduced  from  experimental 

data  is  reasonable  and  essentially  in¬ 
variant  to  changes  in  background  lumi¬ 
nance  and  target  size  strongly  sub¬ 
stantiates  the  above  description  of 
the  decision  process. 


60 


To  complete  the  derivation  of  the 
foveal  detection  probability  it  remains 
to  derive  an  expression  for  the  output 
signal-to-noise  ratio  kQ  in  terms  of 

target  and  background  characteristics 
and  system  parameters.  The  effects  of 
spatial  and  temporal  dispersion  in  the 
components  of  an  image-detecting 
system  are  to  increase  the  area  of  the 
target  image  and  to  increase  the  area 
and  duration  of  a  statistical  unit. 


Although  the  area  of  the  target 
image  is  increased  by  spatial  disper¬ 
sion,  the  differential  number  of  sta¬ 
tistical  units  within  the  image  area 
remains  unchanged.  Since  the  differ¬ 
ential  number  of  statistical  units  is 
unaffected  by  spatial  dispersion,  the 
output  signal  is  equal  to  the  product 
of  the  magnitude  of  a  statistical  unit 
at  the  retina  and  the  number  of  sta¬ 
tistical  units  due  to  the  differential 
irradiance  in  the  perfect  image  of  the 
target  at  the  primary  sensor.  The 
magnitude  of  a  statistical  unit  at  the 
retina  is  (GpC7^)  where  is  the 

particle  gain  between  the  primary 
sensor  and  the  display,  Q  is  the 
collection  efficiency  and  r\ ^  is  the 

responsive  quantum  efficiency  of  the 
visual  system.  The  differential  number 
of  statistical  units  in  the  perfect 
image  at  the  primary  sensor  is  equal  to 
(rigAn^a^Tg) ,  where  r\ ^  is  the  responsive 

quantum  of  the  primary  sensor,  An^  is 

the  mean  differential  irradiance  as  de¬ 
fined  in  Eq.  (7),  a^  is  the  area  of  the 

ideal  image  of  the  target  on  the  primary 
sensor  and  is  the  effective  inte¬ 
gration  time  of  the  image-detecting 
system.  Thus,  the  output  signal  S  is 
given  by  ° 

So  =  ^GD^T|E^r|SClnBiaiTS 

where  CTn__  has  been  substituted  for 
I  BL 

An^.  and  C^.  is  the  contrast  of  the  ideal 
image  or  of  the  target  itself. 

While  the  differential  number  of 
statistical  units  contained  in  the  image 
of  a  target  is  unaffected  by  spatial 
dispersion,  the  number  of  background 


statistical  units  increases  in  pro¬ 
portion  to  the  area  of  the  image.  At 
the  same  time,  the  increase  in  area 
and  duration  of  statistical  units  by 
spatial  and  temporal  dispersion  results 
in  an  overlapping  of  statistical  units. 
The  increase  in  number  of  background 
statistical  units  tends  to  increase 
the  standard  deviation  while  the  over¬ 
lapping  of  statistical  units  tends  to 
decrease  the  standard  deviation  in  the 
number  of  molecular  excitations  pro¬ 
duced  within  the  area  of  the  target 
image  per  integration  time. 

In  order  to  properly  account  for 
the  effects  of  spatial  and  temporal 
dispersion  on  noise  in  an  image-detect¬ 
ing  system,  it  is  necessary  to  represent 
the  random  excitation  of  electrons  at 
the  primary  sensor  and  the  response  at 
the  output  of  the  visual  system  by  power 
density  spectra.  It  is  shown  in  text 
books  concerned  with  statistical  com¬ 
munication  theory  (Lee,  1961)  that  the 
noise  power  density  spectrum  due  to 

a  Poisson  distribution  of  unit  impulses 
which  represent  the  random  excitation 
of  electrons  at  the  primary  sensor  is 
given  by 

h  =  Vbi  (20) 

Likewise,  if  the  image-detecting 
system  is  linear  and  both  spatially 
and  temporally  invariant,  the  output 
noise  power  density  spectrum  is  equal 
to  the  product  of  the  square  of  the 
magnitude  of  the  transfer  function 
and  the  noise  power  density  spectrum 
at  the  input.  Thus,  the  output  noise 
power  density  spectrum  is  given  by 

yv)  =  (GDCnE)2T2T^R2i.(v)  (21) 

where  the  frequency  v  consists  of 
three  components,  two  for  the  two 
orthogonal  axis  in  the  plane  and  one 
for  the  temporal  frequency,  (G^C7^) 

is  the  magnitude  of  a  statistical 
unit  defined  above,  TD  is  the  modu¬ 
lation  transfer  function  of  the  system 
from  the  primary  sensor  to  the  dis¬ 
play,  Tg  is  the  modulation  transfer 

function  of  the  visual  system  and  R 

u 

is  a  low  pass  filter  function  which 
accounts  for  spatial  integration  over 
the  target  image  by  the  visual  system. 
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The  variance  of  the  response  rate 
averaged  over  the  image  area  is  ob¬ 
tained  by  integrating  the  output  power 
density  spectrum  over  all  frequencies. 
Thus,  the  variance  is  given  by 

*0  =  ^SnBI  ^TDTE^Edv 

—  00 


Since  the  system  is  invariant  in 
time  the  temporal  frequency  component 
can  be  separated  from  Eq.  (22)  and 
performed  separately.  The  result  is 

/°T^(f )T2(f )df  =  1/ Tg  where  the  inte- 
—  00 

gral  is  the  noise  equivalent  temporal 
frequency  bandwidth  and  Tg  is  the 

response  time  of  the  system. 

If  the  area  aQ  of  the  target 

image  at  the  retina  is  large  compared 
to  the  area  of  a  statistical  unit, 
the  total  variance  of  the  random 
excitations  is  equal  to  the  product 
of  the  variance  of  the  average  rate  of 
response,  the  square  of  the  image  area 
and  the  square  of  the  response  time. 
This  result  is  given  by 

Qo  =  riSnBiao / TDTEREdv 

-00 

If  Eq.Ts  (19)  and  (23)  are  com¬ 
bined,  the  output  signal-to-noise  ratio 
is  found  to  be 

y  00  j 

ko  “  Cl^ai^ao^^SnBIT^/  TDTEREdv'^^24^ 

Since  the  output  signal-to-noise 
ratio  is  not  accessible  for  direct 
measurement  in  the  visual  system,  it 
is  conventional  to  report  experimental 
data  in  the  form  of  input  contrast  re¬ 
quired  for  50  percent  detection  proba¬ 
bility  as  a  function  of  background 
radiance  and  target  angular  size. 

Since  the  input  contrast  required  for 
50  percent  detection  probability  cor¬ 
responds  to  equality  between  the  output 
and  threshold  signal-to-noise  ratios, 
the  expression  noise  required  input 
contrast  is  adopted  to  designate  it. 

The  noise  required  input  contrast  CTM 


is  obtained  by  setting  kQ  =  kT  in 
Eq.  (24)  and  solving  for  C^.  The 
result  is  given  by 


CIN  mS( 


“„V{J 


(25) 


where 

mS  =  4kT/DS(TTT1SKBTS)  ?  (26) 

In  going  from  Eq.  (24)  to  (25)  the 
shape  of  the  target  was  assumed  to  be  a 
circular  disc  in  anticipation  of  compar¬ 
ison  with  experimental  data  (Blackwell, 
1946),  the  circular  areas  aQ  and  a^ 

have  been  expressed  in  terms  of  their 
angular  subtenses  a  and  a,  respectively, 
at  the  entrance  pup^l  of  the  system, 
the  background  irradiance  n^  at  the 

primary  sensor  has  been  expressed_in 
terms  of  the  background  exitance  nfi 

at  the  scene  and  in  Eq.  (26)  Dg  is  the 

diameter  of  the  entrance  pupil  of  the 
system. 


Since  the  results  of  a  reliable 
determination  of  the  modulation  trans¬ 
fer  function  of  the  visual  system  are 
not  available,  it  is  useful  to  express 
the  noise  required  input  contrast  of 
an  image -detecting  system  in  terms  of 
the  noise  required  input  contrast  of 
the  unaided  visual  system  viewing  a 
projector  screen  at  the  same  luminance 
as  the  display  of  the  electrooptical 
system.  The  result  is  given  by 


CIH<E>  -  Cik<ED>M) 

x  |Va2  +  a^)/(M2a2  +  (27) 

*  [/ITDT^Ed'’/i"TRREd'’]% 

where  E  is  the  mean  luminance  of  the 
scene,  M  is  the  magnifying  power  of 
the  electrooptical  system,  mp  is  ob¬ 
tained  from  Eq.  (26)  by  replacing 
DS,T1S,TS  ky  corresponding  visual 

parameters  and  n^  by  the  mean  exitance 
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of  the  display,  ag  is  the  angle  sub¬ 
tended  at  the  entrance  pupil  of  the 
image-detecting  system  by  the  image  of 
a  point  source  on  the  retina  scaled  to 
its  size  in  object  space,  is  the 

analogous  parameter  for  the  unaided 
visual  system,  TD  is  the  retinal  part 

of  the  modulation  transf er_function  of 
the  visual  system  and  C^CE^M)  is  the 

noise  required  input  modulation  of  the 
unaided  visual  system  viewing  a  pro¬ 
jector  screen  at  the  display  luminance 
Ep.  This  latter  factor  depends  on  M 

because  the  target  angular  size  is 
with  reference  to  the  input  to  the 
image-detecting  system. 

To  present  a  specif ic ’ example  a 
background  limited  low  light  level 
television  system  will  be  considered. 

To  simplify  the  example  it  is  assumed 
that  modulation  transfer  function  of 
the  overall  image-detecting  system  is 
limited  by  the  retinal  part  of  the 
visual  system  so  that  the  integral 
terms  in  Eq.  (27)  cancel  and  the 
bracketed  coefficient  is  unity.  Thus, 
in  this  case  Eq.  (27)  reduces  to 

Cin(E)  =  M(ms/mE)CIN(ED,M)  (28) 

If  reasonable  values  of  the  low 
light  level  television  parameters  are 
assumed  such  as  the  objective  focal 
length  equal  to  12  in. ,  the  width  of  the 
primary  sensor  equal  to  32  mm,  the  view¬ 
ing  distance  24  in. ,  and  the  width  of  the 
display  equal  to  11.3  in.,  the  magnify¬ 
ing  power  equals  4.5. 

The  separate  effects  of  magnifying 
power,  brightness  gain,  collection  ef¬ 
ficiency  and  responsive  quantum  effi¬ 
ciency  on  the  performance  of  the  low 
light  level  television  system  as  an 
adjunct  of  the  visual  system  are  shown 
in  Fig.  3.  It  is  assumed  that  the 
targets  are  circular  discs  in  order  to 
make  use  of  the  Blackwell  data  for  de¬ 
tection  of  circular  discs  projected  on 
a  screen  at  lCn  ft-L  corresponding  to 
the  display  luminance.  In  principle  the 
procedure  could  be  applied  to  any  target 
if  standard  observer  data  for  detection 
on  a  projector  screen  at  the  display 
luminance  were  available.  The  Blackwell 


experimental  disc  detection  data  is 
shown  in  Fig.  3  by  the  dashed  curve 
labeled  "Display  Limit."  This  curve 
depicts  the  logarithm  of  the  function 
CjN(Ed,M)  in  Eq.  (28).  Note  that  the 

values  of  1/a  are  for  object  space  of 
the  low  light  level  television  system. 


Figure  3.  Logarithm  of  Noise  Required 
Input  Contrast  versus  Logarithm  of 
Reciprocal  Target  Angular  Size  where 
the  Angle  is  in  Milliradians .  The 
Curves  are  Discussed  in  the  Text 


If  the  unaided  visual  system  were 
to  view  the  scene  which  is  at  a  lumi¬ 
nance  of  10~5  ft-L,  then  the  Blackwell 
data  results  in  the  log  vs  log 

(1/a)  curve  labeled  "Unaided  Eye"  in 
Fig.  3.  Now  the  effect  of  subjective 
magnification  alone  is  shown  by  the 
curve  labeled  "Telescope"  which  is  ob¬ 
tained  from  the  unaided  eye  curve  by 
increasing  the  values  of  (1/a)  by  the 
magnifying  power  4.5. 
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If  the  diameter  of  the  aperture  of 
the  low  light  television  system  were 
equal  to  that  of  the  telescope  but  the 
brightness  gain  of  10'  were  utilized, 
then  the  curve  labeled  LLTV,  M 

is  obtained  from  the  "Display  Limit" 
curve  by  employing  Eq.  (28).  Note  that 
the  performance  of  the  unaided  eye 
would  be  better  for  values  of  1/a  less 
than  approximately  0.4  mr"  . 


If  the  full  aperture  of  the  system 
were  used  the  curve  labeled  (hg  =  Tj^)  is 

obtained  from  the  "Display  Limit"  curve 
by  employing  Eq.  (28).  Of  course,  en¬ 
larging  the  aperture  improves  the 
performance  but  the  unaided  eye  still 
performs  better  at  values  of  1/a  less 
than  approximately  0.06  mr“l. 


Finally,  if  in  addition  to  sub¬ 
jective  magnification  and  larger  aper¬ 
ture  the  primary  sensor  of  the  low  light 
level  television  system  has  a  respon¬ 
sive  quantum  efficiency  of  5  mA/W  the 
curve  labeled  (hg  “  5mA/W)  is  obtained. 


While  the  performance  of  the  unaided 
eye  is  better  at  values  of  1/a  less 
than  0.03  mr-1  such  large  targets  are 
of  little  interest  in  practice. 


The  effect  of  varying  the  size  of 
the  display  and  hence  the  subjective 
magnification  of  the  assumed  low  light 
level  television  system  is  shown  in 
Fig.  4  for  a  scene  luminance  of  10“ - 
ft-L.  Note  that  as  the  subjective 
magnification  decreases  the  performance 
at  large  angles  improves  but  the  per¬ 
formance  at  small  angles  degrades^.  At 
M  =  0.59  the  coefficient  of  C^N(E^,M) 

is  unity.  Further  decreases  in  M  result 
in  degradation  at  all  angles. 


DISCUSSION  AND  CONCLUSIONS 


reveals  that  at  a  display  luminance 
of  100  ft-L  the  maximum  angular 
diameter  a.  of  the  effective  area  of 
integration  on  the  retina  is  3.7  mr. 
Indeed,  the  reciprocal  (0.28  mr-1) 
corresponds  approximately  to  the  value 
of  1/a  at  which  the  M  =  IX  curve  in 
Fig.  4  bends  upward.  As  M  is  made 
larger  the  angular  size  in  object 
space  at  which  the  image  of  the  target 
on  the  retina  exceeds  3.7  mr  is  shifted 
to  smaller  values,  i.e.,  the  value  of 
1/a-,  with  reference  to  object  space  in¬ 
creases  in  proportion  to  M.  This  is 
apparent  in  Fig.  4  by  comparing  the 
angles  at  which  the  upward  bend  in  the 
curves  occurs. 


The  explanation  for  the  dependence 
of  performance  (contrast  sensitivity) 
of  the  low  light  level  television 
system  on  subjective  magnification  M 
at  large  target  angular  sizes  as  shown 
in  Fig.  4  is  that  the  area  of  the 
image  of  the  target  on  the  retina  is 
greater  than  the  maximum  area  over 
which  the  visual  system  can  integrate. 
Rigorous  analysis  (Schnitzler,  1973) 
of  the  Blackwell  disc  detection  data 


Figure  4.  Effect  of  Varying  Subjective 
Magnification  upon  Log  Noise  Required 
Input  Contrast  versus  Log  Reciprocal 
Target  Angular  Size 

At  large  angular  sizes  such  that 
the  area  of  the  image  of  the  target  on 
the  retina  exceeds  the  maximum  inte¬ 
gration  area  (1/a  <  l/a1  in  object 

space)  an  increase  in  M  results  in  a 
smaller  fraction  of  the  target  image 
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filling  the  maximum  integration  area 
on  the  retina.  Thus,  the  number  of 
statistical  units  due  to  the  differ¬ 
ential  radiance  of  the  target  relative 
to  the  background  and  the  number  of 
statistical  units  due  to  the  background 
radiance  which  are  imaged  on  the  inte¬ 
gration  area  both  decrease  in  proportion 
to  M2.  Since  the  signal  and  noise  are 
proportional  to  the  differential  number 
of  statistical  units  and  the  square 
root  of  the  number  of  background  sta¬ 
tistical  units,  respectively,  they  are 
proportional  to  1/M2  and  1/M,  respec¬ 
tively.  Hence  the  S/N  ratio  is  pro¬ 
portional  to  1/M  and  the  noise  required 
input  contrast  at  large  values  of 
target  angular  size  is  proportional  to 
M. 


The  explanation  for  the  dependence 
of  performance  on  subjective  magnifi¬ 
cation  shown  in  Fig.  4  at  small  target 
angular  sizes  is  that  spatial  dispersion 
determines  a  minimum  image  size  on  the 
retina.  In  this  case  it  can  be  shown 
(Schnitzler,  1973)  that  the_noise  re¬ 
quired  input  contrast,  CL^CE^M)  in 

Eq.  (28),  of  the  unaided  visual  system 
viewing  a  screen  at  the  display  lumi¬ 
nance  Ep  is  proportional  to  1/M2a  . 

Since  it  was  shown  by  Eq.  (28)that_the 
noise  required  input  contrast  C-j-^(E)  of 

the  system  is  proportional  to  MC^N(E^,M), 

we  have  C^(E)  ~  1/Ma2.  If  a  particular 

target  of  small  angular  size  and  some 
known  contrast  is  considered,  then 
1/a  ~  M^.  Since  1/a  is  proportional  to 
range,  the  detection  range 

k 

portional  to  M  . 

If  the  equations  developed  for  the 
foveal  imaging  probability,  the  search 
time  and  the  foveal  detection  probability 
are  considered  together,  the  dependence 
of  performance  on  system  parameters  can 
be  examined.  One  parameter  which  has 
received  considerable  attention  recent¬ 
ly  is  the  size  of  the  display. 


Rp  is  pro¬ 


solid  angle  Q  subtended  at  the  eye  by 
the  display,  i.e.,  t  ~  Q.  Since  it 
was  shown  above  that  ~  M^  and  if 

only  the  display  size  is  variable,  then 
Rp  ~  0^.  Thus,  if  the  display  size  is 

variable,  the  search  time  is  proportion¬ 
al  to  the  fourth  power  of  the  detection 
range.  For  an  11.3  in  wide  display 
viewed  from  24  in  the  search  time 
would  be  5  sec  even  if  the  entire  7 
degree  diameter  of  the  fovea  were 
effective.  Hence,  one  should  be  ex¬ 
ceedingly  cautious  about  increasing 
the  display  size  in  order  to  obtain 
greater  detection  range  unless  random 
search  can  be  avoided  by  prebriefing. 

If  greater  detection  range  is  required, 
the  necessary  subjective  magnification 
can  be  obtained  by  decreasing  the  field 
of  view  in  object  space. 


To  design  a  background  limited 
electrooptical  imaging  system  to  be 
used  in  random  search  for  low  contrast 
targets,  the  first  step  is  to  choose 
as  large  a  display  as  is  consistent 
with  allowable  search  time.  Then  make 
a  trade-off  between  the  desired  de¬ 
tection  range  and  the  field  of  view 
in  object  space  required  to  achieve 
the  necessary  subjective  magnifica¬ 
tion  for  detection.  If  it  is  neces¬ 
sary  to  reduce  the  field  of  view  in 
order  to  achieve  sufficient  subjective 
magnification  for  a  required  detec¬ 
tion  range,  then  it  should  be  accom¬ 
plished  so  as  to  avoid  degradation  of 
the  modulation  transfer  function  of 
the  electrooptical  system  by  choosing 
a  longer  focal  length  objective 
rather  than  by  reducing  the  area  of 
the  primary  sensor.  If  a  required 
field  of  view  and  detection  range 
cannot  be  simultaneously  achieved, 
then  a  larger  objective  aperture  must 
be  chosen. 
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SIGNAL-TO-NOISE  RATIO  THRESHOLDS  FOR  IMAGE  DETECTION 
RECOGNITION  AND  IDENTIFICATION 

Frederick  A.  Rosell  and  Robert  H.  Willson 
Westinghouse' Electric  Corporation 
Defense  and  Electronic  Systems  Center 
Baltimore,  Maryland  21203 

Image  signal-to-noise  ratio  models  are  hypothesized  for  the  detection,  recognition 
and  identification  of  scene  objects  using  an  equivalent  bar  pattern  approach.  The 
extent  to  which  the  models  apply  will  be  discussed  and  the  observer’s  threshold  signal- 
to-noise  ratio  requirements  are  determined  for  various  levels  of  object  discrimination 


through  psychophysical  experimentation. 

INTRODUCTION 

During  the  past  few  decades,  a 
theory  of  imaging,  based  on  the  image 
signal-to-noise  ratio  concept,  has  been 
evolving  through  the  efforts  of  (Barnes, 
1932;  DeVries,  1943;  Rose,  1948;  Shade, 
1967;  Rosell,  1969,  etc.).  Most  of  the 
effort  has  been  devoted  to  determining 
an  observer's  ability  to  detect  images  of 
simple  geometry  and,  rightfully  so; 
simple  images  can  be  easily  generated 
and  can  be  mathematically  described  in 
both  the  space  and  the  Fourier  spatial 
frequency  domain.  By  simple  images,  it  is 
implied  that  the  test  images  are  of 
regular  shape  such  as  squares,  rectangles 
or  disks,  or  patterns  which  are  periodic 
in  one  direction  such  as  trains  of  sine 
or  square  waves  and  aperiodic  in  the 
other.  In  nearly  all  cases,  the  image 
backgrounds  were  uniform.  During  the 
past  few  years,  the  authors  have  attempt¬ 
ed  to  further  verify  and  to  determine  the 
limitations  of  the  current  theory  and  to 
apply  the  re  silts  to  real  imaging  systems 
(Rosell  and  Willson,  1971,  1972) .  More 
recently,  we  have  been  attempting  to 
extend  the  theory  to  more  complex  images 
such  as  may  be  encountered  in  aerial 
reconnaissance.  A  further  goal  is  to 
correlate  the  detectability  of  the  simple 
geometric  images  with  more  complex  levels 
of  image  discrimination .  This  is  of 
value  since  the  simple  images  now  used  in 
the  laboratory  evaluation  of  sensors  can 


be  used  as  an  indicator  of  the  level  of 
performance  to  be  expected  in  field  use. 

In  the  case  of  simple  imagery,  it  is 
sufficient  that  the  observer  "detect"  the 
image  or  "resolve"  a  bar  pattern  structure. 
With  more  complex  images,  the  observer 
may  be  required  to  perform  a  discrimina¬ 
tion  task  such  as  differentiating  between 
two  classes  of  vehicles  or  identifying 
a  particular  type  of  aircraft.  In  this 
paper,  we  will  briefly  describe  the  ele¬ 
mentary  theory,  review  a  small  number  of 
the  experiments  designed  to  verify  the 
elementary  theory  and  discuss  in  somewhat 
more  detail  the  recent  efforts  to  extend 
the  theory  to  real  imagery.  Specifically, 
we  will  be  concerned  with  levels  of  scene 
object  discrimination  ranging  from  the  low¬ 
est  level  of  detection  to  the  more  complex 
levels  of  recognition  and  identification. 

THE  ELEMENTARY  THEORY 

The  elementary  theory  regarding  the 
detection  of  simple  aperiodic  and  periodic 
images  has  been  published  previously  (Ro¬ 
sell  and  Willson,  1970)  and  will  be  further 
developed  and  reviewed  by  R .  Sendall  in 
these  Proceedings.  For  the  present  pur¬ 
pose,  we  note  one  of  the  important  results 
which  relates  the  signal-to-noise  ratio, 
SNRpj,  of  the  image  as  it  appears  on  the 
display  of  a  television  sensor  to  the 
video  signal-to-noise  ratio,  SNRy,  as 
measured  in  the  sensor's  electrical 
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channel.  This  relation  is 

SNRjjj  =  [2tAfv(f )  ]*  •  SNR^  (l) 

where  t  is  an  integration  time  in  sec., 

Afy  is  the  video  bandwidth  in  Hz.,  and  a  is 
the  image  area  relative  to  the  total  effec¬ 
tive  photosurface  area  on  which  a  is 
imaged.  This  relationship  holds  only  for 
images  which  are  unlimited  by  the  sensor's 
spatial  frequency  response,  of  uniform  in¬ 
tensity  amplitude  and  for  noise  spectrum 
which  is  essentially  white  in  character. 

The  above  equation  describes  the 
SNR„  obtainable  from  the  sensor.  If  now, 
we  could  determine  the  SNR^,  required  by 
the  observer  to  perform  some  simple  task 
such  as  detecting  a  rectangle,  we  could 
determine  some  sensor  performance  limits 
by  matching  that  obtainable  with  that 
required.  To  obtain  the  observer's 
requirements,  we  used  the  experimental  set¬ 
up  shown  in  Fig.  1.  In  this  experiment,  a 


Fig.  1  The  Display  Signal-to-Noise  Ratio 
Experiment . 

signal  pulse  of  rectangular  waveshape  but 
variable  duration  is  electronically  genera¬ 
ted  and  mixed  with  band-limited  white  noise 
of  Guassian  distribution.  The  spatial 
image  displayed  on  the  cathode  ray  tube 
(CRT)  display  is  a  rectangle  which  can 
appear  in  any  of  four  quadrants  (but  always 
in  the  same  position  in  the  quadrant  se¬ 
lected)  .  The  observer  is  asked  to  specify 
the  quadrant  in  which  the  image  is  located 
as  the  video  signal-to-noise  ratio  and  the 
image  locations  are  randomly  varied.  The 
observer  is  asked  to  specify  the  image 
location  whether  he  could  see  it  or  not. 

The  probability  of  detection,  determined 
in  this  manner,  was  then  corrected  for 


chance  using  the  formula  =  (P  -  P  )/ 

(l  -  Pc)  where  P.  is  the  corrected  C 
probability,  P  is  the  raw  probability 
data  and  P  is°the  probability  due  to 
chance  (0.25  in  the  case  cited).  Two 
noise  bandwidths  were  used,  7*1  and  12.5 
MHz  and  the  observation  times  per  trial 
were  usually  10  seconds.  The  observer 
distance  from  the  8"  high  picture  dis¬ 
played  was  28"  unless  otherwise  specified 
and  the  display  background  luminance  was 
either  0.2  -  0.3  or  1  ft-Lambert.  The 
television  monitor  was  operated  at  30 
frames  per  second  with  a  525  line  scan  in 
the  vertical. 

For  the  rectangle  experiments,  it 

was  found  to  be  convenient  to  define  the 

image  size  in  terms  of  the  dimensions  of 

a  single  scan  line.  Thus,  we  define  the 

quantities  L  and  L  as 
x  y 

L  L  =  (490) 2  <r( a/A)  ,  (2) 

where  490  is  the  number  of  active  lines 
in  a  conventional  525  line  television 
display  and  a  is  the  width-to-height  pic¬ 
ture  aspect  ratio  of  the  total  effective 
picture  on  the  CRT.  Combining  Eqs.  (l) 
and  (2),  we  have 

SNHjjj  -  (1/490)  •  (2LxLytAfv/<y)^  •  SNRy.  (3) 

This  is  the  equation  used  to  calculate 
the  SNRjyr  **or  the  rectangular  images  used 
in  the  experiments  below.  The  numerical 
values  used  were  t  =  0.1  sec  and  a  —  4/3- 

For  the  experiment,  we  hypothesized 
that  the  SNR~^  required  to  liminally* 
detect  a  stationary  rectangular  image  of 
variable  length  would  be  a  constant  inde¬ 
pendent  of  the  image's  area.  To  test 
this  notion,  we  measured  probability  of 
detection  vs  SNRy  and  SNR^  for  rectangles 
of  size  4x4,  4  x  64,  4  xJT28  and  4  x 
180  scan  lines.  The  results,  plotted  in 
the  form  of  probability  of  detection  vs 
video  SNR  are  shown  in  Fig.  2  where  it  is 
seen  that  the  larger  the  image  length, 
the  smaller  the  SNR^  required  for  a  given 
level  of  probability.  When  the  probabili¬ 
ty  of  detection  is  plotted  vs  the  display 
signal-to-noise  ratio  as  calculated  using 

*  For  liminal  detection,  we  mean  a  50$, 

or  threshold,  probability  of  detection. 
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Fig.  2  Corrected  Probability  of 

Detection  vs  Video  Signal- 
to-Noise  Ratio  for  Rectangular 
Images  of  Size  Olxl,  o  1  x  64, 
A  1  x  128  and  O  1  x  180  Scan 
Lines. 


Eq.  (3)  and  shown  in  Fig.  3,  the  data 
collapses  to  a  single  curve  confirming 
the  original  hypothesis.  These  results 
hold  for  a  wide  range  of  rectangles  so 
long  as  not  more  than  one  of  the  rectan¬ 
gle's  dimensions  exceeds  about  l/2  rela¬ 
tive  to  the  observer's  eye.  For  a  more 
detailed  discussion  of  this  experiment  an 
and  many  others,  the  reader  is  referred 
to  (Rosell,  1972).  The  above  results 
represent  800  observations  by  5  observers. 
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Fig.  3  Corrected  Probability  of 

Detection  vs  SNRpj  Required 
for  Rectangular  Images  of  Size 
O  1  x  1,  □  1  x  64,  A  1  x  128  and 
Ol  x  180  Scan  Lines. 


Although  the  equations  above  were 
derived  for  an  isolated  rectangular 
image,  it  is  hypothesized  that  they  also 
apply  to  the  detection  of  bar  and  sine 
wave  patterns  on  the  premise  that  to 
detect  the  presence  of  a  bar  pattern,  the 


observer  must  detect  the  presence  of  a 
single  bar.  However,  the  threshold  signal- 
to-noise  ratio  required  to  detect  a  bar  in 
the  presence  of  a  number  of  bars  may 
differ  from  that  needed  to  detect  an  iso¬ 
lated  bar  on  a  uniform  background  but  will 
not  differ  much  as  we  will  see.  Suppose 
that  a  single  bar  in  a  pattern  is  of  size 
AyA  wide  by  eAyA  long  where  e  is  the  bar 
length-to-^width  ratio  and  Y  is  the  picture 
height .  Note  that  we  use  dimensionless 
size  units  because  of  its  convenience 
when  a  number  of  image  size  changes  occur 
within  a  system.  Also,  for  reasons  which 
will  become  apparent  as  we  progress,  we 
will  prefer  to  use  the  reciprocal  dimen¬ 
sion,  N,  as  given  by 


(4) 


The  units  of  N  are  "lines  per  picture 
height".  Then,  the  bar  image  area  relative 
to  the  total  effective  photocathode  area 
is  equal  to 


A  2 

(*\  _  SA£_  _ 

A  »Y2  a  N2 


(5) 


where  a  is  the  displayed  horizontal  to 
vertical  picture  aspect  ratio  and  N  is 
designated  the  pattern's  spatial  frequency 
in  "lines  per  pi ct lire  height" .  With  this 
result,  Eq.  (3)  becomes 


1  ei 

SNRpj  =  [2tAfv/cv]5  *j-  •  SNRy 


(6) 


To  obtain  the  observer's  requirements 
for  the  periodic  patterns,  we  employed  the 
experimental  set-up  of  Fig.  4. 
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Fig.  4  Experimental  Set-up  for  the  Tele¬ 
vision  Camera  Generated  Imagery. 
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The  periodic  test  images  used  were 
primarily  bar  patterns,  of  various  height- 
to-width  ratios,  spatial  frequencies  and 
numbers  of  bars  in  the  pattern  which  were 
projected  on  the  faceplate  of  a  high 
resolution  lj"  vidicon  operated  at  high¬ 
light  video  signal-to -noise  ratios  of  50:1 
or  better.  The  camera  and  TV  monitor  were 
operated  at  25  frames/second  with  875 
scanning  lines  (825  active).  Band-limited 
white  noise  of  Guassian  distribution  was 
mixed  with  the  camera  generated  signal. 

Both  the  signals  and  noise  were  passed 
through  identical  filters  (noise  equivalent 
bandwidth  of  12.5  MHz)  prior  to  mixing  in 
the  monitor.  The  monitor  luminance  was 
approximately  1  ft.  Lambert  in  most  cases. 

Because  the  vidicon  camera  has  a 
limited  modulation  transfer  function,  the 
displayed  images  were  square  wave  patterns 
at  very  low  spatial  frequencies,  sine  wave 
patterns  at  high  spatial  frequencies  and 
distorted  square  wave  patterns  in  between. 
Generally,  the  peak-to-peak  signal  is 
measured  in  the  laboratory  but  as  Schade 
(1967)  notes,  the  displayed  bar  pattern 
detectability  depends  upon  the  mean 
signal  rather  than  the  peak-to-peak  signal 
modulation.  However,  knowing  the  sensor's 
modulation  transfer  function  |R  (N)|  ,  the 
mean  signal  response,  called  th§  square 
wave  flux  response  Rqi?(N)  can  be  computed 
from 

VN)  -  i  S  _  1  ^  |s0(k»)|  .  (7) 

k  -  1,  3,  5 - 

where  |  R  (kN) |  represents  the  values  of  the 
sensor 1  s°MTF  at  frequencies  kN  where  kN  are 
the  odd  harmonics  of  frequency  N.  For  the 
vidicon  used  in  our  experiments,  the  square 
wave  amplitude,  modulation  transfer  and 
square  wave  flux  responses  are  as  shown  in 

Fig.  5. 

Using  the  square  wave  flux  response 
concept,  we  calculate  the  SNR^-r  from  the 
relation 

-  <£>,  .  (8) 

n 

In  the  above,  Ai  is  the  peak-to-peak  sig¬ 
nal  current  for  &  broad  area  pattern  (uni¬ 
ty  modulation  transfer  function)  and  I  is 
the  rms  noise  that  is  added  to  the  camera 
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Fig.  5  Square  Wave  Amplitude  (- - ), 

Modulation  Transfer  ( - ),  and 

Square  Wave  Flux  (—  •  — )  Functions 
for  the  l£"  Vidicon  used  in  the 
Bar  Pattern,  Recognition  and  Iden¬ 
tification  Experiments. 


generated  image.  Real  cameras,  of  course, 
have  a  response  that  is  a  function  of 
frequency  and  the  value  of  AiQ  in  the 
video  channel  for  square  wave°inputs  be¬ 
comes  Aip-p>  the  peak-to-peak  value  of  the 
video  signal  when  the  frequency  effects 
are  included,  that  is,  we  have 


Aip-P  =  AioRSQ(N)  (9) 

and 


SNRDI  =  [- 


r^]Vfv^  (I) 


p-p 


(10) 


where  Rgp(N)  is  the  value  of  the  flux 
factor  at  N,  R^q(N)  is  the  value  of  the 
square  wave  response  at  N  and  Aip_p  the 
value  of  the  peak-to-peak  signal  corre¬ 
sponding  to  N  as  measured  in  the  output  of 
he  video  channel.  For  calculation 
purposes,  t,  the  integration  time  of  the 
eye  is  taken  to  be  0.1  sec  and  a,  the 
picture  aspect  ratio  is  4/3*  At  low  spa¬ 
tial  frequencies  the  displayed  images 
approach  a  square  wave  while  at  high 
spatial  frequencies,  above  about  500 
lines/picture  height,  the  displayed  images 
were  nearly  pure  sine  waves.  The  main 
point  of  the  above  discussion  is  to  note 
that  the  SNR^  are  calculated  on  the  basis 
of  mean  signal  to  rms  noise  since  we  have 
corrected  the  measured  signal  using  the 
square  wave  amplitude  and  square  wave  flux 
factors . 


For  the  first  series  of  experiments, 
the  test  images  were  bar  patterns  of 
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various  spatial  frequencies  and  5:1 
height-to-width  ratio.  The  observer  was 
required  to  state  whether  or  not  the 
pattern  displayed  was  resolvable  as  the 
images  SNR^  were  randomly  varied. 

Chance  was  not  involved  since  the  pat¬ 
terns  were  always  present  on  the  display. 
A  number  of  bar  patterns  of  various  spa¬ 
tial  frequencies  were  displayed  at  the 
same  time  and  the  observer  was  asked  to 
indicate  the  pattern  of  highest  frequency 
that  was  barely  visible  as  the  signal-to- 
noise  ratio  was  systematically  varied 
from  high  levels  to  low  levels  and  the 
reverse  in  steps  of  1  db.  The  fraction 
of  patterns  resolved  by  this  Mmethod  of 
limits*1  are  plotted  in  Fig.  6  as  a 
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Fig.  7  Threshold  SNR^  vs  Bar  Pattern 

Spatial  Frequency  Obtained  Using 
Two  Different  Experimental  Tech¬ 
niques,  •  Method  of  Limits  and  o 
Methods  of  Random  SNR  Variation. 


I  2  3  4  5  6  7 


Fig.  6  Probability  of  Bar  Pattern  De¬ 
tection  vs  SNRjyj.  for  Bar  Patterns 
of  Spatial  Frequency  o  104,  •  200, 
□  329,  ■  396,  O  4B2,  and  ♦  635 
Lines  per  Picture  Height .  Bar 
Height-to-Width  Ratio  was  5  in 
All  Cases. 
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Fig.  8  Probability  of  Bar  Pattern 

Detection  vs  SNR^  for  a  396  Line 
Bar  Pattern  of  Bar  Length-to- 
Width  Ratio  o  5:1,  0  10:1,  and  • 
20:1. 


function  of  display  signal-to-noise  ratio 
and  the  thresholds  for  50$  probability 
of  detection  are  plotted  in  Fig.  7* 

To  test  the  notion  that  the  eye 
integrates  over  the  total  length  of  a 
bar,  we  employed  bar  patterns  of  various 
length-to-width  ratios.  A  typical  result 
is  shown  in  Fig.  8  for  a  396  line  bar 
pattern.  The  thresholds  for  three 
different  bar  height-to-width  ratios  are 
shown  in  Fig.  9  and  are  seen  to  be 
similar  to  those  shown  in  Fig.  7*  These 
data  were  obtained  using  random  signal- 
to-noise  ratios  rather  than  the  "method 
of  limits"  used  in  the  first  series  of 
measurements.  Either  method  gives 
about  the  same  results.  Since  the 
"method  of  limits"  is  most  often  used  in 
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Fig.  9  Threshold  SNR^  vs  Bar  Pattern 
Spatial  Frequency  for  Three  Bar 
Length-to-Width  Ratios  of  o  5:1, 
o  30:1,  and  •  20:1. 
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lab  evaluations  of  TV  equipments,  it  is 
interesting  to  note  that  the  method  is 
appropriate.  Further  efforts  to  confirm 
the  utility  of  the  "method  of  limits" 
were  made  with  the  results  shown  in 
Fig.  7. 

In  the  above  experiments,  the  bar 
patterns  were  vertically  oriented  with 
their  longitudinal  axes  perpendicular 
to  the  direction  of  scan.  The  results 
with  the  axes  horizontal  are  shown  in 
Fig.  ID.  One  observer  was  used  for  the 


observations  per  experiment. 
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Fig.  10  Threshold  SNR^,.  Required  to 
Recognize  the  Presence  of  • 
Horizontally  Oriented  and  o 
Vertically  Oriented  Bar  Pattern 
vs  Bar  Pattern  Spatial  Frequency. 

experiment.  Also  shown  in  Fig.  10  are 
the  results  for  vertical  bars  for  the 
same  observer  (the  same  results  as  those 
shown  in  Fig.  7)«  As  can  be  seen,  the 
orientation  is  immaterial  at  the  low 
spatial  frequencies  but  the  thresholds 
for  the  high  spatial  frequencies  increase 
with  horizontal  bars.  Indeed,  the  635 
line  pattern  could  only  be  seen  with 
noise  added  to  the  signal. 
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Fig.  11  Threshold  SNR^  vs  Bar  Pattern 

Spatial  Frequency  for  Display-to- 
Observer  Viewing  Distances  of  o 
14",  o  28"  and  •  56". 

LEVELS  OF  DISCRIMINATION 

The  purpose  of  many  electro-optical 
equipments  is  to  augment  an  observer  so 
that  he  can  discern  a  scene  object  at 
longer  range  than  he  could  with  his 
unaided  eye.  These  objects  may  be  seen 
with  greater  or  lesser  clarity  depending 
on  the  object’s  size,  distance,  contrast, 
radiance,  and  the  sensor's  resolving  power. 
Here,  we  are  concerned  with  a  condition 
where  a  scene  object  needs  to  be  discerned 
with  sufficient  clarity  to  serve  some 
intended  function  and,  that  the  observer 
is  highly  motivated  to  do  so.  The  words 
"sufficient  clarity"  should  be  stressed. 

In  one  case,  it  may  be  sufficient  to 
merely  detect  a  blob  such  as  a  channel 
buoy  while  in  other  cases,  much  higher 
acuity  is  needed.  For  example,  it  is  of 
no  use  to  televise  and  record  a  burgulary 
if  the  recording's  acuity  is  insufficient 
to  identify  the  burgular  in  a  court  of  law. 


In  the  next  bar  pattern  experiment, 
viewing  distance  was  varied  in  discrete 
steps  from  14"  to  28"  to  56".  The  bars 
were  vertically  oriented.  The  results 
are  as  shown  in  Fig.  11.  At  the  short 
distances,  the  low  pattern  frequencies 
become  less  detectable  while  the  reverse 
was  true  at  the  long  viewing  distances. 
It  is  clear  from  Fig.  11  that,  for  a 
given  line  number,  there  is  a  viewing 
distance  that  minimizes  threshold  SNR^^.. 

The  above  experiments  involved  5 
observers  on  the  average  and  500  to  2000 


It  is  intuitively  obvious  that  an 
object  may  be  seen  with  greater  or  lesser 
clarity  depending  on  the  object's  size, 
distance,  contrast,  brightness  and  the 
capability  of  the  observer  alone  or  the 
observer  as  augmented  by  an  imaging  device. 
In  planning  a  surveillance  task,  the  maxi¬ 
mum  or  threshold  range  at  which  a  given 
object  can  be  barely  discerned  at  the 
desired  level  of  discrimination  is  clearly 
of  interest .  At  very  long  range,  a  scene 
object  may  appear  first  as  a  blob.  Moving 
closer,  the  object  can  be  discerned  to 
have  some  shape  such  as  a  rectangle . 
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Closer  yet,  the  object  becomes  able  to 
classify  the  object  as  to  type  and 
finally,  to  identify  the  object  positive¬ 
ly.  (Johnson,  1958)  has  arbitrarily  di¬ 
vided  these  levels  of  object  discrimina¬ 
tion  into  k  categories  which  are: 


Table  I 

Levels  of  Ob.iect  Discrimination 


Classification 

of 

Discrimination 

Level  Meaning 


Detection  An  object  is  present . 

Orientation  The  object  is  approxi¬ 

mately  symmetrical  or 
unsymmetrical  and  its 
orientation  may  be  dis¬ 
cerned. 


Recognition  The  class  to  which  the 

object  belongs  may  be 
discerned  (e.g.,  house, 
truck,  man,  etc.). 

Identification  The  target  can  be 

described  to  the  limit 
of  the  observer’s  know¬ 
ledge  (e.g.,  motel, 
pickup  truck,  policemen, 
etc . ) . 


It  is  readily  evident  that  higher 
resolution  is  needed  to  identify  an 
object  than  to  just  detect  it.  To  obtain 
a  quantitative  feel  for  the  problem, 
Johnson  performed  a  series  of  experi¬ 
ments  using  electro-optical  sensors.  In 
these  experiments,  an  attempt  was  made 
to  correlate  the  detectability  of  a 
bar  pattern  of  a  given  spatial  frequency 
with  the  level  of  object  discrimination. 
The  procedure  was  to  increase  the  object 
range  until  it  was  just  barely  detected 
(or  recognized,  etc.).  Then,  a  bar 
pattern  was  placed  in  the  field  of  view 
and  its  spatial  frequency  was  increased 
until  it  could  barely  be  resolved  at  the 
same  range .  The  spatial  frequency  of  the 
pattern  was  specified  in  terms  of  the 
number  of  lines  in  the  pattern  subtended 
by  the  object's  minimum  dimension  as 
illustrated  in  Fig.  12  where  the  object, 
for  the  recognition  case,  subtends  8 
lines . 


Fig .  12  Resolution  Required  per  Minimum 
Object  Dimension  to  Achieve  a 
Given  Level  of  Object  Discrimina¬ 
tion  Expressed  in  Terms  of  an 
Equivalent  Bar  Pattern. 

Johnson's  results,  as  tabulated  below 
are  not  unexpected.  If  the  observer  could 
only  just  resolve  a  coarse  pattern  corre¬ 
sponding  to  2  bars  per  minimum  object 
dimension,  the  level  of  object  discrimina¬ 
tion  was  limited  to  detection.  With  high¬ 
er  acuity,  a  bar  pattern  of  higher  spatial 
frequency  could  be  discerned  and  the  level 
of  object  discrimination  increased  in  turn. 


Table  II 

Johnson's  Criteria  for  the 
Resolution  Required  per  Minimum 
Ob.iect  Dimension  vs  Discrimination  Level 

Resolution 

Discrimination  per  Minimum  Object 

Level  Dimension  (TV  Lines) 


Detection 

2.0 

+  1.0 
-  0.5 

Orientation 

2.8 

+  0.8 
-  0.4 

Recognition 

8.0 

+  1.6 
-  0.4 

I dent if i c at ion 

12.8 

+  3.2 
-  2.8 

This  table  has  been  widely  used  and 
misused  by  systems  designers  from  the  time 
of  its  publication  to  the  present.  The 
misuse  stems  from  the  neglect  of  addition¬ 
al  requirements  imposed  by  Johnson,  to 
wit,  that  the  "signal-to-noise"  ratio  and 
image  contrast  must  also  be  sufficient . 
However,  it  was  not  too  clear  how  these 
quantities  were  to  be  measured  and 
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calculated  and  thus,  the  further  require¬ 
ments  were  neglected  in  many  cases.  How¬ 
ever,  many  competent  designers  did  use 
the  sensor’s  threshold  resolution  vs 
scene  irradiance  curves  in  estimating 
the  level  of  discrimination.  Since  the 
threshold  curves  do  contain  image  sign al¬ 
to-noise  as  a  factor  in  their  measurement, 
estimates  made  on  this  basis  turn  out  to 
be  reasonable  if  not  precise. 

Most  sensors  are  characterized  by  an 
absolute  limiting  resolution.  If  the 
sensor  sensitivity  at  a  given  scene 
irradiance  level  and  object  contrast  is 
sufficient  to  realize  the  limiting  reso¬ 
lution  but  the  resolution  is  still  not 
sufficient  to  perform  the  desired  discri¬ 
mination  task,  further  increases  in  scene 
irradiance  level  will  be  to  no  avail. 

The  only  solution  is  to  move  closer.  On 
the  other  hand,  if  the  resolution  were 
sufficient  at  a  given  irradiance  level,  a 
decrease  in  scene  irradiance  level  could 
cause  the  acuity  of  a  sensor  and  observer 
combination  to  fall  below  the  level  re¬ 
quired  for  a  wanted  level  of  object 
discrimination . 

To  illustrate,  suppose  it  is  desired 
to  discriminate  the  vehicle  of  Fig .  13 . 
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Fig.  13  Object  and  Image  Resolution 

Required  to  Identify  an  Object 
in  Terms  of  the  Equivalent  Bar 
Pattern  and  the  Scene,  Lens  and 
Focal  Plane  Parameters. 

The  minimum  scene  resolution  required  for 
a  vehicle  of  minimum  dimension  w  ft  will 
be  w/k^  ft  where  k^  is  the  number  of 
lines  needed  for  the  wanted  level  of 
discrimination  per  Table  II.  At  range, 

R,  the  sensor-augment ed-observer ' s  angu¬ 
lar  resolution,  A0,  must  then  be  w/k^R 


radians.  The  image  size.  Ay,  will  be 
equal  to  Fj  •  A0  where  F,  is  the  lens 
focal  length.  Suppose  the  primary  sensor, 
upon  which  the  scene  is  imaged  is  a  tele¬ 
vision  camera  tube. 

The  resolution  of  the  TV  camera  tube 
is  measured  by  placing  a  bar  pattern  of 
spatial  frequency  N  lines/picture  height 
into  the  field  of  view  and  determining 
whether  the  test  bar  pattern  can  be  re¬ 
solved  or  not.  If  the  number  of  lines  N, 
which  the  camera  can  resolve  is  equal  to 
or  greater  than  the  reciprocal  dimensions 
N  of  the  object  image,  the  object  can  pre¬ 
sumably  be  resolved  as  well.  For  this 
comparison  to  be  valid,  the  test  bar 
pattern  must  meet  certain  other  conditions 
as  will  be  discussed. 

As  a  numerical  example,  suppose  the 
vehicle  of  Fig.  13  is  of  height  5'  and  is 
to  be  identified  so  that  k^  =  13.  the 
scene  resolution  must  be  about  44"  corre¬ 
sponding  to  an  angular  resolution  A0  of 
64  x  10“^  radians  at  a  range  of  6000  ft . 

For  a  typical  TV  camera  tube  picture 
height  of  15  mm,  and  a  threshold  resolu¬ 
tion  of  250  lines/pict.  ht.,  a  lens  focal 
length,  F^,  of  940  mm  or  37"  will  be 
needed . 

Johnson's  equivalent  bar  pattern  re¬ 
sults  were  held  to  be  valid  if  the  bar 
spacing  is  chosen  on  the  basis  of  Table 
II,  if  the  bar  pattern  contrast  is  the 
same  and,  if  the  image  signal-to-noise 
ratio,  or  SNR,  is  sufficient.  It  might 
then  be  inferred  that  image  contrast, 
resolution  and  SNR  are  independent  quanti¬ 
ties.  However,  we  shall  derive  functional 
relationships  between  them.  Then  Johnson's 
requirements  will  reduce  to  one;*  namely, 
that  an  object  should  be  discriminated  at 
the  desired  level  if  its  signal-to-noise 
ratio  at  the  output  of  the  observer's 
retina  after  processing  and  interpretation 
by  the  brain  is  large  enough.  Obviously, 
the  signal-to-noise  ratio  as  defined  in 
this  manner,  is  not  directly  measurable 
but  can  be  indirectly  measured  through 
psychophysical  experimentation. 

To  show  a  typical  result,  we  will 
describe  a  recognition  experiment .  Four 


*  Using  the  equivalent  bar  pattern  concept 
to  define  resolution. 
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vehicles,  a  tank,  a  van  truck,  a  half 
track  with  antenna  and  a  derrick  bulldozer 
were  televised.  Noise  of  known  amount  was 
added  to  the  imagery  and  the  probability 
of  the  observer  correctly  recognizing  the 
type  of  vehicle  was  determined  as  a  func¬ 
tion  of  the  vehicle  image's  SNR  with  the 
result  shown  in  Fig.  14 .  The  method  of 
determining  the  SNR  will  be  discussed  in 
detail  below.  The  main  point  is  that  it 
appears  to  be  possible  to  associate  a 
signal-to-noise  ratio  with  an  image  of  a 
real  object  based  on  its  dimensions, 
contrast  and  the  sensor  parameters.  We 
see  in  Fig.  14,  that  the  SNR^.  required 
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Fig.  14  Probability  of  Recognition  vs 

SNRp-j.  for  a  0  Tank,  O  Radar  Half 
Track,  □  Van  Truck,  and  •  Derrick 
Bulldozer .  Background  was 
Uniform. 


to  recognize  the  vehicles  with  50 %  proba¬ 
bility  is  about  3*  To  test  Johnson's 
approach,  the  vehicles  were  replaced  by 
equivalent  bar  patterns  of  spacing  1/8  the 
minimum  vehicle  dimension  and  with 
individual  bars  of  length  equal  to  the 
vehicles  length.  The  bar  pattern  imagery 
was  mixed  with  noise  and  probability  of 
bar  pattern  detection  was  determined  as  a 
function  of  image's  SNR  with  the  result 
shown  in  Fig.  15 .  For  $0%  probability  of 
detection,  an  SNRj^.  of  2.8  is  seen  to  be 
needed.  The  correlation  between  the 
recognition  of  the  real  object  and  the 
detection  of  the  equivalent  bar  pattern 
is  seen  to  be  very  good  —  and  perhaps, 
better  than  is  warranted  when  other 
factors  are  considered  as  we  shall  discuss. 


The  above  experiments  were  conducted 
using  a  uniform  white  background.  The  ^ 
vehicle  areas  were  approximately  0.057  in 
on  the  8"  x  10.7"  display  and  subtended 


angles  of  about  0.34°  by  0.68°  at  the 
observer's  eye.  The  vehicle  types  and 
video  SNR  were  randomly  varied;  and  the 
probabilities  of  recognition,  corrected 
for  chance,  were  determined.  The  SNRj^.'s 
for  the  various  images  were  calculated'1  on 
the  basis  of  the  area  of  a  bar  whose 
length  and  width  are  equal  to  the  length 
of  the  vehicle 1  s  image  and  the  width  of 
the  vehicle's  image  divided  by  8.  This  is 
in  accord  with  the  equivalent  bar  pattern 
concept  discussed  above  and  illustrated  in 
Fig.  12.  We  note,  however,  one  difference 
between  the  calculations  for  the  bar 
pattern  and  the  vehicular  image's  SNR~  . 

In  the  case  of  the  vehicular  image,  the 
signal  amplitude  was  measured  from  the 
background  signal  level  which  was  approxi¬ 
mately  constant,  to  the  peak  object 
signal  level.  For  the  "equivalent  bar 
patterns",  the  signal  levels  were  measured 
in  terms  of  the  mean  signal  excursion 
within  the  bar  pattern  area  in  the 
periodic  direction.  Had  the  peak-to- 
peak  excursions  about  the  average  signal 
within  the  vehicle  area  been  used  (when 
the  object  is  imaged  against  a  uniform 
background),  the  threshold  SNR^  would 
have  been  somewhat  lower. 
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Fig.  15  Probability  of  Bar  Pattern  De¬ 
tection  vs  SNRpj.  for  Pattern 
Area  Equal  to  Average  Real  Object 
Area  of  N  =  o  329,  7  Bars;  ■  396, 
7  Bars;  O  482,  9  Bars;  ♦  635,  11 
Bars. 


These  difficulties  result  from  the 
necessity  of  defining  an  image  area  and  a 
signal  excursion  in  order  to  calculate  an 
SNR^  threshold.  In  this  connection,  we 
observe  that  the  criterion  for  bar  pattern 
"recognition"  is  that  the  observer  must  be 
able  to  discern  a  modulation  within  the 
bar  pattern  whereas  for  vehicle  image 
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recognition,  the  vehicle’s  outline  must 
be  discerned.  This  outline  may  have 
periodic  features  but  is  more  likely  to 
be  aperiodic .  While  we  recognize  and 
even  emphasize  these  differences,  we 
do  not  now  have  better  criteria  to 
suggest .  It  is  felt  that  recognition 
predictions  based  on  the  equivalent  bar 
pattern  concept  will  tend  to  be  some¬ 
what  pessimistic  because  the  effect  of  an 
MTF  is  generally  more  severe  on  periodic 
objects  than  on  aperiodic  objects.  It 
is  interesting  to  observe  that  the 
thresholds  measured  for  recognition 
of  various  vehicle  types  are  nearly 
constant  but  that  the  scatter  in  the  data 
increases  as  we  move  from  the  lowest  level 
of  discrimination  (detection)  to  the 
highest  (identification).  An  alternative 
concept  for  recognition  and  identification 
might  be  to  define  some  sub-area  on  the 
image  and  treat  it  as  an  aperiodic  ob¬ 
ject  .  Intuitively,  we  would  expect  that 
range  predictions  made  on  this  basis 
would  be  optimistic  because  sensor  MTF's 
will  tend  to  reduce  the  modulation  of 
image  detail  features  which  are  not  well 
separated  from  one  another. 

Each  object  used  in  the  recognition 
experiments  did  have  a  feature  which 
approached  the  idea  of  an  isolated  object 
feature  such  as  a  boom  arm,  a  turret,  etc. 
It  may  be  that  a  single  feature  is 
characteristic  of  a  specific  object  and  to 
recognize  the  object,  the  characteristic 
feature  must  be  discerned.  In  each  case, 
the  maximum  signal  excursion  in  the 
"characteristic  feature"  was  the  same 
as  the  largest  found  for  the  whole  target, 
or  nearly  so.  Also,  the  width  of  the 
feature  was  about  1/8  the  total  minimum 
width  of  the  object.  A  similar  situation 
was  found  in  the  identification  experi¬ 
ments  where  the  features  needed  were 
closer  to  l/l3  the  object's  minimum  width. 

In  Fig.  16,  we  show  the  signal 
excursions  for  the  tracked  bulldozer  on 
selected  horizontal  lines  from  the  top 
to  the  bottom  of  the  vehicle  with  line  1 
being  just  above  the  object  and  line  17 
just  below.  These  traces  were  taken  on 
every  other  line  from  a  line  selector 
oscilloscope.  The  dominant  features  of 
the  bulldozer  can  be  located  in  the 
traces  of  Fig.  16.  The  boom  arm  is  in 
traces  3  through  6,  the  cab  in  6  through 
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Fig.  16  Waveform  of  Derrick  Half  Track 
Along  the  Horizontal  as  a 
Function  of  Vertical  Position 
in  Scan  Lines. 

8  and  the  bulldozer  blade  in  9  through 
13 .  The  boom  appeared  to  be  the  most 
characteristic  feature  and  the  width  was 
of  the  order  of  l/8  width  of  the  bull¬ 
dozer. 

The  second  recognition  experiment 
was  performed  by  superimposing  a  back¬ 
ground  transparency  over  the  same  vehicle 
transparencies  as  were  used  above.  The 
background  transparency  has  regions  con¬ 
taining  a  road,  a  grassy  field,  and  grass 
among  trees.  In  the  experiment,  a  vehicle 
was  randomly  chosen  and  located  within  one 
of  the  randomly  chosen  background  regions 
noted  above.  In  all,  12  vehicle-back¬ 
ground  combinations  were  shown  at  5  ran¬ 
domly  selected  values.  The  SNR^j 

was  calculated  as  in  the  first  recognition 
experiment .  The  results  using  a  road 
background  are  shown  in  Fig.  17 .  The 
spread  in  the  data  was  small  and  a 
threshold  value  of  SNE^  is  3*8  which  is 
only  15$  higher  than  that  noted  for  a 
uniform  background.  The  results  with 
grass  background  are  shown  in  Fig .  18  and 
give  a  threshold  value  of  SNE^  of  4*1; 
a  value  that  is  24$  higher  than  for  a 
uniform  background.  In  Fig.  19,  a 
threshold  value  of  5*0  was  obtained  for 
the  recognition  of  vehicles  in  a  grass 
and  tree  background  which  is  52$  higher 
than  the  uniform  background  case  and  31$ 
higher  than  the  road  background  case  and 
22$  higher  than  the  grass  background  case. 
As  one  would  expect,  increases  in  back¬ 
ground  complexity  increase  the  SNE^ 
thresholds  observed  but  the  increases  are 
not  large. 


78 


Considering  Figs.  14,  17,  IQ  and  19, 
it  is  seen  that,  on  the  average,  the  der¬ 
rick  bulldozer  and  van  truck  have  an  SNR^ 
threshold  that  is  about  26$  lower  than 
that  required  for  the  tank  and  radar 
lalf  track.  By  visually  comparing  these 
objects,  it  appeared  reasonable  that  less 
resolution  would  be  needed  to  recognize 
the  derrick  bulldozer  and  van  truck. 

Assume  that,  in  general,  a  different 
resolution  is  required  to  recognize  each 
object  and  that  if  this  resolution  were 
used  in  the  SNR^  calculation  (instead  of  8) 


Fig.  17  Probability  of  Recognition  vs 

SNRd,  for  a  0  Tank,  O  Radar  Half 
Track,  O  Van  Truck,  and  •  Derrick 
Bulldozer  Against  a  Road  Back¬ 
ground  . 


recognition  is  8  +  2  lines. 
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Fig.  19  Probability  of  Recognition  vs 

SNRdi  for  a  O  Tank,  O  Radar  Half 
TracPc,  G  Van  Truck,  and  •  Derrick 
Bulldozer  Against  a  Grass-Trees 
Background . 

Next,  we  progressed  to  vehicle  iden¬ 
tification  experiments.  The  images  for 
this  experiment  were  transparencies  of 
the  M47  Patton,  the  M48  Centurion,  the 
Panther  and  the  Stalin  tanks  as  shown  in 
Fig.  20.  As  for  the  previous  vehicle 
recognition  case,  the  photographs  were 
taken  to  show  the  sides  and  £ops  of  the 
vehicles  from  an  angle  of  45  from  the 
horizontal.  Two  image  sizes  were  used. 

In  the  first  experiment,  the  average 
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Fig.  18  Probability  of  Recognition  vs 

SNRp  for  a  O  Tank,  O  Radar  Half 
Track,  o  Van  Truck,  and  •  Derrick 
Bulldozer  Against  a  Grass  Back¬ 
ground  . 

that  the  value  of  SNR^  would  be  the  same 
for  the  various  images.  The  spread  in  the 
measured  data,  using  the  value  of  8,  should 
correspond  to  the  spread  in  the  required 
resolution  and,  from  the  spread  we  have 
that  the  required  resolution  for  target 


Fig .  20  Photographs  of  Tank  Models  Used 
in  Identification  Experiments. 
Upper  Left  -  M47,  Upper  Right  - 
M48,  Center  Left,  -  Stalin,  Center 
Right  -  Panther,  Lower  Center  - 
Centurion. 

image  on  the  8n  x  10.7“  display  was  0.9 
in^  and  subtended  1.3°  x  2.6°  at  the 
observer's  eye;  and  in  the  second  experi¬ 
ment,  the  image  was  about  2.2  in^  and  the 
angular  extent  was  about  2°  x  4°. 
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The  objects  and  SNR  values  were  randomly 
varied  and  the  probabilities  of  correct 
identification  were  corrected  for  chance. 
For  the  calculation  of  SNR^,  the  image 
area  was  assumed  to  be  a  bar  whose  length 
and  width  are  equal  to  the  length  of  the 
vehicles  image  and  the  width  of  the  ve¬ 
hicle’s  image  divided  by  13  using  the 
equivalent  bar  pattern  concept  discussed 
and  illustrated  in  connection  with  Fig. 

12.  The  results  of  the  first  identifica¬ 
tion  experiment  are  shown  in  Fig.  21  where 
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Fig.  21  Probability  of  Identification  vs 
SNRnj  for  Tanks  of  Angular  Extent 
1.3tt  x  2.6°  Relative  to  the  Eye 
Against  a  Uniform  Background. 
Tanks  were  0  M47,  O  Centurion, 

■  Panther,  and  O  Stalin. 


we  obtain  a  threshold  SNR^  of  5.2.  This 
is  nearly  identical  to  the  value  obtained 
in  a  separate  experiment  for  an  equivalent 
bar  pattern  which  had  13  bars  with  a 
length-to-width  ratio  of  26  and  a  spatial 
frequency  of  137  lines/pict .  ht . 

In  the  second  identification  experi¬ 
ment,  we  obtained  a  threshold  value  of 
6.8  as  shown  in  Fig.  22.  An  estimate  of 
6.2  was  obtained  for  an  equivalent  bar 
pattern  of  26:1  length-to-width  ratio  and 
a  spatial  frequency  of  101  lines/pict.  ht . 
From  the  Figs.  21  and  22,  it  is  seen  that, 
on  the  average,  the  Panther  and  Stalin 
tanks  are  seen  with  an  SNR^,  at  threshold, 
which  is  about  42$  smaller  than  that 
required  for  the  M48  and  M47»  From  the 
photographs  of  the  tanks  in  Fig.  20,  it  is 
seen  that  the  Stalin  and  Panther  tanks 
appear  more  distinctive  than  the  M47  and 
M48  so  that  the  lower  threshold  SNR^  is 
not  entirely  unexpected. 


Fig.  22  Probability  of  Identification  vs 
SNRDj  for  Tanks  of  Angular  Extent 
2°  x  4°  Relative  to  the  Eye 
Against  a  Uniform  Background. 

Tanks  were  O  M47,  o  Centurion, 

■  Panther,  and  O  Stalin. 

In  summary,  it  was  hypothesized  that 
the  recognizability  and  identifiability 
of  "real  world"  objects  could  be  corre¬ 
lated  with  the  discemability  of  an 
"equivalent  bar  pattern.11  The  "equivalent 
bar  pattern"  was  defined  (in  image  space) 
in  terms  of  a  single  bar  in  the  pattern  of 
length  equal  to  the  length  of  the  image 
of  the  "real  world"  object  and  width  equal 
to  the  real  world  image  1 s  width  divided 
by  k, .  The  factor,  k,,  is  a  number 
associated  with  a  given  discrimination 
level,  e.g.,  8  for  recognition.  When  the 
bar  pattern  is  defined  in  the  above  manner, 
it  was  found  experimentally  that  the  SNR^j 
required  to  liminally  discern  the  equiva¬ 
lent  bar  pattern  was  very  nearly  equal  to 
the  SNRpj  required  to  recognize  the  ’’real 
world"  image  when  its  SNR^j  is  calculated 
on  the  basis  of  its  area  divided  by  k^. 

It  might  be  thought  that  defining  thea 
area  of  the  bar  in  the  equivalent  bar 
pattern  as  1/8  the  area  of  the  "real 
world"  image's  area  and  then  calculating 
the  SNRqj  for  the  real  world  image  on  the 
basis  of  1/8  its  area  is  redundant  but 
this  is  not  so.  The  key  reason  for  defin¬ 
ing  the  bar  pattern  in  the  manner  described 
is  to  define  the  required  level  of  sensor 
resolution  in  terms  of  a  spatial  frequency. 
This  permits  us  to  take  into  account  the 
effects  of  finite  sensor  apertures.  As  we 
noted  previously,  Johnson  specified  that 
an  image,  to  be  discerned  at  a  given  level 
of  discrimination  must  be  both  resolved 
and  have  a  sufficient  SNR.  In  the  above 
formulation,  using  the  equivalent  bar 
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pattern  approach,  SNRDj  is  a  function  of 
resolution  so  that  if  the  SNRDj  is 
sufficient,  the  resolution  will  be  suffi¬ 
cient  as  well.  The  threshold  SNR^j's  re¬ 
quired  for  the  various  levels  of  discri¬ 
mination  are  estimated  in  Table  III. 

These  thresholds  apply  to  a  50 %  probabili¬ 
ty  but  can  be  adjusted  for  any  other  level 
of  probability  by  use  of  Fig .  23 . 
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Fig .  23  Probability  vs  Normalized  SNR^  • 
For  Any  Probability  Value, 

Obtain  SNRj.  From  Table  III  for 
50$  Probability .  Find  Value  of 
k  for  Desired  Probability  and 
Multiply  Value  of  SNRj)j  by  k  to 
Obtain  New  Value  of  SNR^^. 
Required . 

We  noted  an  important  caution  in 
connection  with  the  method  of  calculation. 
The  "real  world"  objects  are  not  periodic 
In  general  but  rather,  assemblages  of 
aperiodic  objects  of  different  sizes . 


These  aperiodic  objects  which  make  up  the 
object  may  be  relatively  isolated  as  in 
the  case  of  the  derrick  on  the  tracked 
bulldozer  or  more  clutter-like  as  in  the 
case  of  the  half  track.  Thus,  the 
equi valent  bar  pattern  approach  weights 
cluttered  scenes  more  heavily  and  may 
tend  to  be  somewhat  pessimistic,  particu¬ 
larly  at  high  line  numbers. 
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TABLE  III  Best  Estimate  of  Threshold  SNRDj 


for  Detection,  Recognition  and 
Identification  of  Images  . _ 


*d 

Threshold  SNRp.T 

TV  Lines 

for 

per 

Spatial  Frequency 

Minimum 

(Lines/Pict.  Ht .) 

Discrimination 

Back¬ 

Dimen- 

of 

Level 

ground 

sion 

100 

300 

500 

700 

Detection 

Uniform*  1 

< — 

—  2! 

,8 - 

> 

Detection 

Clutter 

2 

4.8 

2.9 

2.5 

2.5 

Recognition 

Uniform 

8 

4.8 

2.9 

2.5 

2.5 

Recognition 

Clutter 

8 

6.4 

3.9 

3.4 

3.4 

Identification 

Uniform 

13 

5.8 

3.6 

3.0 

3.0 

*  Treated  as  an  Aperiodic  Object. 
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THE  PREDICTION  OF  SIGNAL  STRENGTH  REQUIRED 
FOR  IMAGE  DETECTION /RECOGNITION 
ON  A  RASTER  GENERATED  DISPLAY 

by 

Robert  Sendall 
Electro-Optical  Systems 
Pasadena,  California  91107 

ABSTRACT 

The  theory  of  detection  of  visual  anomolies  on  a  raster  generated  display  is  inde¬ 
pendent  of  the  type  of  sensor  supplying  the  information.  While  recognition  and  there¬ 
fore  detection  in  clutter  requires  training,  the  basic  theory  is  independent  of  the 
sensor.  This  paper  presents  an  analytical  approach  which  can  be  used  with  any  sensor. 

It  is  compatible  with  the  famous  J.  Johnson  criteria  and  is  based  upon  the  theory  that 
the  operator  is  a  spatial  and  temperal  integrator  making  detection  decisions  dependent 
upon  the  perceived  signal- to-noise  ratio. 

The  paper  shows  effects  of  system  component  modulation  transfer  functions  as  they 
affect  both  signal  and  noise.  Both  white  and  colored  displayed  noise  will  be  discussed 
reflecting  the  effects  of  various  possible  noise  source  injection  points.  In  general 
the  perceived  signal-to-noise  ratio  will  be  related  to  the  video  signal-to-noise  ratio 
and  the  probability  of  detection  will  be  related  to  the  perceived  signal-to-noise  ratio. 

Many  discussions  with  F.  Rosell  and  a  few  with  J.  M.  Lloyd  have  aided  in  the  devel¬ 
opment  of  the  analysis  which  is  essentially  an  adaptation  of  the  papers  of  0.  Schade. 


COMPLETE  PAPER  HAS  BEEN  PUBLISHED  UNDER  AFAL-TR-72-374  mE/0  SENSOR 
PERFORMANCE  ANALYSIS  AND  SYNTHESIS  (TV/IR  COMPARISON  STUDY)"  - 
ROBERT  SENDALL  AND  FREDERICK  ROSELL  -  AUTHORS 
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DETECTABILITY  THRESHOLDS  FOR  LINE-SCAN  DISPLAYS 
Robin  L.  Keesee 

Virginia  Polytechnic  Institute  and  State  University 


Blacksburg, 


ABSTRACT 

To  evaluate  the  validity  of  the 
Modulation  Transfer  Function  Area  (MTFA)  as 
a  proposed  unitary  measure  of  TV  image 
quality,  the  visual  thresholds  for  detec¬ 
tion  of  tribar  targets  on  a  video  monitor 
as  functions  of  target  modulation,  target 
spatial  frequency,  white  noise  level,  and 
noise  passband  is  determined  for  three 
representative  TV  systems  having  different 
line  rates  and  bandwidths.  For  these 
three  specific  systems,  optical  measurements 
of  the  displayed  video  noise  measured  in 
luminance  terms,  will  be  compared  with  the 
electrical  rms  measurements  of  noise 
inserted  into  the  video  system.  Implica¬ 
tions  of  these  types  of  measurements  and 
their  importance  in  specifying  the 
displayed  image  quality  will  be  stressed. 

INTRODUCTION 

In  the  ongoing  search  for  a  summary 
measure  of  TV  image  quality,  the  modulation 
transfer  function  area  (MTFA)  has  become 
increasingly  attractive  to  some  researchers 
(Snyder,  1970).  Charman  and  Olin  (1965) 
performed  an  analytical  validation  of  the 
concept  for  photographic  systems,  and  a 
group  at  Boeing  (Borough,  Fallis,  Warnock, 
and  Britt,  1967)  showed  that  there  was  a 
high  correlation  between  MTFA  and  photo- 
interpreters’  opinion  of  photographic 
image  quality.  Although  preliminary 
results  indicated  that  the  MTFA  is  a  valid 
measure  of  TV  image  quality  (Snyder,  1972), 
a  thorough  evaluation  and  practical  utili¬ 
zation  of  the  MTFA  as  a  video  image  quality 
metric  has  been  impeded  by  the  lack  of 
satisfactory  detectability  threshold  data. 
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Conventional  contrast  thresholds  are 
unsuitable  because  they  do  not  include  the 
effects  of  the  sampling  process  of  the 
video  raster.  Coltman  and  Anderson  (1960) 
reported  the  experimental  determination  of 
some  detectability  thresholds  as  verifi¬ 
cation  of  a  theoretical  derivation.  The 
usefulness  of  these  curves  is  perhaps 
suggested  by  quoting  from  their  paper: 

’’...The  amount  of  data  taken 
was  limited,  and  conditions  of 
surround  brightness,  time  interval 
between  tests,  etc.,  were  not 
carefully  controlled,  so  that  the 
data  presented  here  do  not  con¬ 
stitute  a  definitive  study  of  this 
particular  visual  parameter. ...” 

Clearly,  an  experimental  determination  of 
line-scan  detectability  thresholds  is 
needed.  This  paper  describes  an  attempt 
at  filling  this  need. 

The  quality  of  the  MTFA  evaluation  and 
the  extent  of  the  use  of  the  MTFA  as  an 
image  quality  metric  depends  on  the  gener¬ 
ality  of  the  detectability  threshold 
curves.  There  are  several  things  to  be 
considered  in  planning  for  this  generality. 
The  most  obvious  element  in  such  planning 
is  that  the  generality  is  directly  propor¬ 
tional  to  the  number  of  different  detect¬ 
ability  thresholds  determined.  Thus,  the 
threshold  curves  were  found  for  several 
different  system  MTF’s  and  for  a  full  range 
of  spatial  frequencies,  modulations,  and 
noise  levels.  The  effect  of  noise  pass- 
band  on  the  detectability  threshold  curves 
was  also  investigated.  Generality  is 
especially  enhanced  by  describing  quanti¬ 
tatively  the  input  to  the  TV  system  and 
the  system  output.  Further,  the  experiment 
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was  carefully  controlled  so  that  the  input/ 
output  quantification  remained  appropriate 
throughout  the  experiment. 

The  psychophysical  method  of  adjust¬ 
ments  was  used  to  reduce  subject  error  and 
to  help  eliminate  experimenter  bias.  The 
method  of  adjustments  required  the  depen¬ 
dent  variable  to  be  continuous.  Among 
system  bandwidth  and  line  rate,  target 
size,  target  modulation,  noise  level,  and 
noise  passband,  only  noise  level  could 
conveniently  be  made  a  continuous  dependent 
variable.  For  data  stability,  a  large 
number  of  data  points  for  each  combination 
of  values  of  the  independent  variables  was 
needed,  which  dictates  the  use  of  a  few 
highly  practiced  subjects  and  many  trials. 
The  input/output  of  the  video  system. was 
measured  including  camera  lens,  camera, 
etc.,  so  that  the  resulting  methods  of 
measurement  are  generally  applicable  to 
other  systems.  Using  the  complete  video 
chain  precluded  any  electrical  generation 
of  targets.  Further,  since  the  stimulus 
to  the  subject  in  finding  the  detectability 
thresholds  is  light  emitted  from  the  front 
of  the  monitor,  then  the  video  output 
measurements  should  be  photometric. 

DETERMINATION  OF  DETECTABILITY  THRESHOLDS 
Design  of  the  Experiment 

A  summary  of  the  specific  experimental 
design  is  given  in  Figure  1.  Three  systems 
with  different  MTFTs  were  used.  Three 
different  MTFfs  were  obtained  by  operating 
the  closed-circuit  television  (CCTV)  at  32 
MHz  with  a  1225-lines  per  frame  rate,  at 
16  MHz  bandwidth  and  945  lines,  and  at  8 
MHz  bandwidth,  525  lines.  The  square  wave 
responses  corresponding  to  these  three 
MTF's  are  given  in  Figure  2.  Three  or 
four  different  noise  passbands  at  each 
bandwidth/ line  rate  combination  were  in¬ 
cluded,  as  shown  in  Table  1.  The  targets 
were  a  series  of  8  x  10-inch  photographs 
on  the  1951  USAF  tribar  pattern  with  black 
or  gray  bars  against  a  white  background. 

The  bar  or  square-wave  target  was  chosen 
for  simplicity  of  definition  of  the 
subject  response  criterion  and  for  ease  of 
displayed  target  measurement.  The  targets 
were  made  in  seven  spatial  frequencies 
with  eight  modulations  at  each  spatial 
frequency  and  were  displayed  with  the  bars 
perpendicular  to  the  raster  lines.  The 


raster  was  vertical,  as  viewed  by  the 
subj ect . 

NOISE  PASS8AND 
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o  o  o  o  o  o 

s  ^  9  ^  ^  S 
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TRIALS 

Figure  1.  Design  of  the  Experiment 


Figure  2.  Square-Wave  Response  for 
the  Three  Systems 

Subjects  were  three  male  and  two 
female  paid  students  having  a  minimum  of 
20/22  binocular  and  20/25  monocular 
visual  acuity  without  correction  and 
without  any  other  visual  anomalies  as 
tested  with  the  Bausch  &  Lomb  Otho-rater. 
Each  subject  received  six  trials  at  each 
combination  of  spatial  frequency,  modu¬ 
lation,  noise  passband,  and  bandwidth/line 
rate,  for  a  total  across  all  subjects  of 
18,480  data  points. 
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SPATIAL  FREQUENCY 

TV  LINES/INCH,  CRT 


Figure  3.  Video  System 

The  television  system  is  block  dia¬ 
grammed  in  Figure  3.  One  of  the  fifty-six 
targets  was  placed  before  the  Cohu  6100 
camera.  Composite  video  from  the  Cohu 
6900  camera  control  unit  (CCU)  passed 
through  a  35  MHz  mixer  to  a  17-inch  Conrac 
RQA  monitor.  A  General  Radio  1383  Random 
Noise  Generator  produced  a  white  noise  in 
the  20  Hz  to  20  MHz  spectrum.  Narrower 
noise  passbands  were  achieved  by  passing 
this  noise  through  one  or  two  passive 
filters.  Bandwidth  and  line  rate  changes 
were  made  by  changing  components  within 
the  camera  and  the  CCU. 

Components  of  a  Gamma  Scientific 
photometer  were  assembled  in  either  a 
telephotometer  or  microphotometer  mode  to 
make  measurements  at  the  CRT  monitor. 

The  subject,  monitor,  noise  generator, 
noise  filters,  photometer,  and  the  first 
experimenter  were  in  one  room.  In  an 
adjoining  room  were  the  camera,  CCU, 
targets,  and  the  second  experimenter.  An 
intercom  connected  the  two  experimenters. 
Both  of  these  rooms  were  climate  controlled. 

Each  subject  was  seated  so  that  the 
eye-to-CRT  distance  was  40  inches  and  the 
eyes  were  level  with  the  center  of  the 
CRT.  The  noise  generator  was  placed  so 
that  the  noise  level  could  be  adjusted  by 
the  subject  while  the  rms  noise  level 
meter  was  visible  only  to  the  experimenter. 
The  experimental  room  was  dark  except  for 
the  monitor  and  a  small  lamp  for  the 
experimenter.  No  light  from  this  small 
lamp  fell  in  the  subject’s  field  of  view. 

Procedure 

The  normal  daily  experimental  proce¬ 
dure  began  with  allowing  one  hour  of 
warm-up  for  all  of  the  electronics  prior 
to  the  start  of  data  collection.  At  the 
end  of  this  warm-up,  the  system  was 
checked  for  calibration  using  a  gray  scale 


target  in  front  of  the  camera.  Overall 
video  and  blanking  levels  at  the  CCU  were 
checked  and  adjusted  to  constant  levels. 
Composite  video  from  the  CCU  was  monitored 
on  an  oscilloscope  and  any  changes  neces¬ 
sary  in  the  CCU  adjustments  were  made  to 
return  the  video  levels  for  each  gray  bar 
to  set  values.  With  the  electrical  input 
to  the  monitor  thus  normalized,  the  image 
on  the  monitor  of  the  gray  scale  target 
was  viewed  with  the  telephotometer.  The 
luminance  of  three  particular  gray  bars, 
one  near  white,  the  second  middle  gray, 
and  a  third  near  black,  was  measured,  and 
the  contrast  and  brightness  controls  of 
the  monitor  were  adjusted  to  bring  the 
luminance  of  these  bars  within  certain 
tolerances.  This  calibration  procedure 
was  performed  at  the  beginning  of  each 
experimental  session  and  repeated  every 
hour  during  the  session.  As  an  example 
of  the  importance  of  these  procedures,  the 
luminance  of  the  near  white  gray  bar  was 
adjusted  during  calibration  to  be  between 
18.0  and  18.5  footlamberts .  After  an  hour 
of  operation,  the  luminance  would  drift 
one  or  two  footlamberts.  If  this  drift 
had  continued  unchecked,  the  results  would 
have  been  suspect  since  the  stimuli  were 
truly  random.  There  is  no  known  evidence 
to  suggest  that  this  drift  is  peculiar  to 
this  TV  system. 

After  calibration,  a  subject  was 
seated  and  the  seat  height  was  adjusted 
so  that  the  subject’s  eyes  were  in  proper 
position  and  the  subject  was  comfortable. 
The  first  experimenter  requested  a  target 
at  random.  The  second  experimenter  placed 
the  requested  target  before  the  camera. 

The  subject  increased  the  noise  level 
until  he  could  no  longer  determine  that 
there  were  three  separate  bars.  The  first 
experimenter  recorded  this  noise  level, 
increased  the  noise  level  until  well  past 
the  point  where  the  target  was  not 
visible,  and  told  the  subject  to  proceed. 
The  subject  then  reduced  the  noise  level 
until  he  could  just  determine  that  there 
were  three  separate  bars.  This  noise 
level  was  then  recorded,  completing  the 
pair  of  trials.  The  subjects’  criterion 
was  not  that  of  a  detection  task  or  a 
recognition  task;  the  criterion  was  simply 
the  existence  or  non-existence  of  three, 
separate  bars. 
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In  theory,  since  the  ascending  trial 
gave  a  noise  level  slightly  higher  than 
the  actual  threshold  noise  level  and  the 
descending  trial  gave  a  noise  level 
slightly  below  the  threshold,  the  mean  of 
these  two  trials  would  be  the  actual 
threshold.  With  practice,  a  pair  of 
trials  was  completed  in  about  12  seconds. 
Subjects  worked  for  periods  of  20  to  25 
minutes  and  were  then  given  a  five- to- ten- 
minute  break.  Three  of  these  periods  plus 
the  calibration  procedures  filled  a  two- 
hour  experimental  session.  Each  of  the 
five  subjects  worked  each  day  during  the 
experiment  and  for  no  more  than  two  hours. 
This  work  was  demanding  and  any  more  than 
two  hours  per  day  per  subject  caused 
subject  fatigue  and  erratic  performance. 

Results 

Means  of  each  combination  of  spatial 
frequency,  modulation,  noise  passband,  and 
bandwidth/ line  rate  were  calculated  and 
plotted.  To  describe  quantitatively  the 
results,  simple  linear  regressions  were 
calculated  with  modulation  predicting 
threshold  noise  level  at  each  spatial  fre¬ 
quency  and  at  each  of  the  eleven  bandwidth/ 
line  rate-noise  passband  combinations. 
Multiple  linear  regressions  of  spatial 
frequency  and  modulation  predicting  thres¬ 
hold  noise  level  were  also  obtained  at 
each  bandwidth/ line  rate-noise  passband 
combination. 

Figures  4  through  7  show  the  relation¬ 
ship  between  square  wave  modulation  and 
the  threshold  input  noise  level  given  in 
rms  millivolts  for  the  four  noise  passbands 
used  in  the  32  MHz  bandwidth/ 12 25  line 
system. 
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SQUARE  WAVE  MODULATION,  CRT 


Figure  4.  Detectability  Threshold  Means 
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Figure  5.  Detectability  Threshold  Means 
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SQUARE  WAVE  MODULATION,  CRT 


Figure  6.  Detectability  Threshold  Means 


SQUARE  WAVE  MODULATION,  CRT 


Figure  7.  Detectability  Threshold  Means 

Figurr  4  is  for  the  0  to  20  MHz  noise 
passband.  The  relationship  among  the 
parameters  is  as  expected.  At  any  spatial 
frequency,  as  modulation  increases  the 
threshold  rms  noise  also  increases.  As 
spatial  frequency  increases  (target  size 
diminishes),  the  threshold  noise  declines. 
Note  that  for  the  largest  target,  spatial 
frequency  equal  to  5.1  TV  lines/ inch, 
the  threshold  curve  is  nonlinear.  In  the 
segment  where  modulation  is  less  than 
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about  0.3,  any  increase  in  modulation  is 
matched  by  an  increase  in  the  noise  thres¬ 
hold.  Beyond  this  segment,  an  increase  in 
modulation  allows  little  or  no  increase  in 
noise.  A  plateau  of  sorts  has  been 
reached.  For  the  largest  targets  with 
higher  contrast,  most  of  the  subjects  com¬ 
mented  that  the  whole  target  disappeared 
at  the  point  in  increasing  noise  where 
they  could  no  longer  determine  the  exis¬ 
tence  of  three  separate  bars.  In  other 
words,  the  target  was  totally  obliterated 
at  the  noise  threshold.  In  other  targets, 
the  target  remained  visible  as  a  blur  as 
noise  was  increased  above  the  threshold 
even  though  three  separate  bars  could  not 
be  seen. 

Notice  also  in  this  plot,  as  well  as 
those  following,  that  the  maximum  modu¬ 
lation  plotted  at  each  spatial  frequency 
declines  with  increasing  spatial  frequency. 
This  is  an  illustration  of  the  roll-off 
of  the  modulation  transfer  factor  as 
spatial  frequency  increases. 

Although  targets  of  seven  spatial 
frequencies  were  used  at  each  of  the 
eleven  system  combinations,  only  the  lower 
spatial  frequencies  are  shown  in  these 
graphs.  Even  with  a  fair  range  of  modu¬ 
lation  on  the  photographic  prints  of  the 
higher  spatial  frequencies,  the  displayed 
modulation  range  is  quite  small  due  to  the 
TV  system  MTF  roll-off,  yielding  mean 
threshold  curves  of  only  two  or  three 
points.  For  clarity,  these  curves  of 
limited  usefulness  were  not  plotted. 

The  detectability  threshold  curves 
for  the  0  to  5  MHz  noise  passband  given  in 
Figure  5  show  the  same  ordering  of  points 
and  general  shape  as  those  in  Figure  4. 
These  threshold  noise  levels  are  only 
about  half  as  great  as  those  at  the  0  to 
20  MHz  passband.  Since  the  0  to  5  MHz 
band  is  contained  in  both  of  these  noise 
passbands,  it  must  mean  that  the  0  to  5 
MHz  noise  band  contains  the  most  detri¬ 
mental  noise  frequencies.  This  is 
reasonable,  since  the  lower  frequencies 
will  cause  the  larger  ’’snow  flakes.”  The 
energy  expended  in  the  5  to  20  MHz  noise 
frequencies  apparently  has  much  less  effect 
on  target  detectability. 

The  detectability  thresholds  for  the 
3.6  to  5.0  MHz  noise  passband  given  in 


Figure  6  are  slightly  lower  than  those  of 
the  0  to  20  MHz  noise  passband.  If  the 
noise  in  the  3.6  to  5.0  MHz  region  were 
an  important  contributor  to  the  impairment 
of  this  sort  of  target  detection  then  the 
rms  noise  voltages  required  to  reach 
threshold  conditions  would  be  substantially 
less  than  that  rms  noise  required  at  the 
0  to  20  MHz  noise  passband.  But  they  are 
not  substantially  less.  More  important, 
the  threshold  rms  noise  voltage  for  any 
target  is  much  higher  at  3.6  to  5.0  MHz 
noise  passband  than  at  the  0  to  5  MHz 
noise  passband,  indicating  the  important 
noise  frequencies  are  between  0  and  3.6 
MHz. 

Because  noise  in  the  3.6  to  5.0  MHz 
passband  was  relatively  ineffectual  in 
degrading  targets,  then  noise  energy  in 
the  3.6  to  10  MHz  is  even  more  wasted. 

These  thresholds,  shown  in  Figure  7,  are 
the  highest  in  the  32  MHz  bandwidth/1225 
line  group. 


Figure  8.  Detectability  Threshold  Means 


Figure  9.  Detectability  Threshold  Means 
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Figure  10.  Detectability  Threshold  Means 


Figure  11,  Detectability  Threshold  Means 


Figure  12.  Detectability  Threshold  Means 


Figure  13.  Detectability  Threshold  Means 


SQUARE  WAVE  MODULATION,  CRT 

Figure  14.  Detectability  Threshold  Means 

The  detectability  thresholds  for  the 
16  MHz/945  line  and  8  MHz/525  line  systems 
are  presented  in  Figures  8  through  14. 

The  ordering  and  shape  of  the  curves  is 
similar  to  those  of  the  32  MHz/1225  line 
systems.  The  detectability  thresholds  in 
the  8  MHz/525  line  system  with  a  1.9  to 
5.0  MHz  noise  passband  are  considerably 
higher  than  those  for  the  0  to  5  MHz  noise 
passband  at  the  same  bandwidth/line  rate. 
This  result  suggests  that  the  most  detri¬ 
mental  noise  frequencies  are  below  about 
2  MHz. 

At  the  525  line  rate,  on  a  10  in.  x 
14  in.  monitor,  2  MHz  converts  to  about  10 
TV  lines/inch  at  the  monitor.  As  can  be 
seen  in  Figure  1,  this  is  the  point  where 
the  square-wave  response  for  the  8  MHz/ 

525  line  system  starts  to  decline.  One 
might  conclude  that  the  reduced  contri¬ 
bution  of  noise  frequencies  greater  than 
2  MHz  is  due  to  the  system  response  roll¬ 
off  above  2  MHz;  that  is,  frequencies 
greater  than  2  MHz  have  less  amplitude 
due  to  the  monitor  square-wave  response. 
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This  is  only  a  partial  explanation 
of  the  importance  of  low  frequency  noise. 

At  the  1225  line  rate,  the  3.6  to  5.0  MHz 
noise  passband  falls  between  7.5  and  10.0 
TV  lines/inch  spatial  frequency  at  the 
monitor.  The  square-wave  response  for  the 
32  MHz/1225  line  system  is  relatively  flat 
to  about  10  TV  lines/inch  at  the  display 
although  the  square-wave  modulation 
response  is  only  about  .67.  So,  the 
elevation  of  the  noise  thresholds  for  the 
3.6  to  5.0  and  3.6  to  10.0  noise  pass- 
bands  is  due  to  both  the  differential  roll¬ 
off  of  the  higher  frequency  noise  by  the 
system  square-wave  response  and  to  the 
inherent  detriment  of  lower  frequency 
noise,  i.e.,  noise  less  than  about  2  MHz. 
The  rms  noise  scale  changes  with  each 
bandwidth/line  rate  combination. 

Discussion 

Regression.  Simple  and  multiple 
linear  regressions  were  originally  applied 
to  the  data  for  several  reasons.  First, 
a  simple  algebraic  description  of  the 
results  was  sought  to  convert  the  data  to 
a  form  useable  in  MTFA  calculations.  The 
original  data  were  in  the  form  of  threshold 
noise  level  as  a  continuous  variable 
dependent  on  discrete  values  of  spatial 
frequency  and  modulation  for  each  noise 
passband-bandwidth/line  rate  combination. 
For  each  combination,  an  applied  linear 
regression  is  of  the  form 

a  (SF )  +  b(M)  +  c  =  aN  (1) 

where  SF  is  the  spatial  frequency  in  TV 
lines/inch,  M  is  modulation,  is  the 
threshold  noise  level  in  rms  millivolts, 
and  a,  b,  and  c  are  constants  of  the 
regression. 

Also,  the  MTFA  requires  threshold 
curves  relating  spatial  frequency  and  modu¬ 
lation  with  noise  level  at  a  discrete 
parameter.  The  required  algebraic  form 
is 

a  (SF)  +  c  =  M  (2) 

with  equal  to  some  particular  constant. 
Of  course  the  regression  equation  (1)  can 
be  solved  for  modulation  (M)  to  satisfy 
the  MTFA  requirement.  Figure  15  plots, 
as  an  example,  the  multiple  linear 
regression  for  the  32  MHz/1225  line,  0  to 
20  MHz  noise  passband  data. 


Figure  15.  Detectability  Threshold 

Multiple  Linear  Regression 

A  good  question  is  the  legitimacy  of 
the  linear,  rather  than  nonlinear, 
regressions.  Linear  regressions  were 
originally  fit  for  speed  and  simplicity 
As  shown  in  the  summary  of  the  multiple 
linear  regressions,  Table  1,  the  minimum 
correlation  coefficient  is  .90;  at  least 
81%  (100  x  .90^)  of  the  variance  is  pre¬ 
dicted  by  the  linear  equation.  Naturally, 
all  F  tests  for  regression  effect  proved 
highly  significant.  This  result  does  not 
indicate  that  the  linear  model  is  the 
best-fitting  one,  however.  An  F  test  for 
lack  of  fit  applied  to  the  simple  linear 
regression  for  32  MHz/1225  line,  0  to  20 
MHz  noise  passband,  and  spatial  frequency 
equal  to  5.1  TV  lines/inch  was  significant 
(jd  less  than  .001)  indicating  that  the 
linear  model  was  incorrect.  In  a  rash 
attempt  to  find  a  better  model,  a  stepwise 
multiple  regression  was  applied  to  27 
different  transformations  of  spatial  fre¬ 
quency  and  modulation  using  the  32  MHz / 

1225  line,  0  to  20  MHz  noise  passband 
results.  Using  F  =  .01  for  inclusion  and 
F  =  .005  for  deletion,  the  regression 
stopped  after  12  steps  at  a  multiple 
correlation  coefficient  of  .94,  indicating 
that  no  further  increase  of  the  coefficient 
was  possible  by  either  adding  or  deleting 
any  of  the  27  transformations.  Since  the 
multiple  r  for  the  linear  model  is  .92 
(Table  1),  the  improvement  in  percentage 
of  the  variance  predicted  by  the  regression 
is  3%  [(.94)2  =  .88,  (.92)2  =  .85].  Such 
a  small  improvement  does  not  seem  to 
warrant  the  use  of  an  equation  of  twelve 
off-beat  variables  predicting  threshold 
noise.  This  does  suggest  that  12%  of  the 
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data  variance  is  truly  random  and  unpre¬ 
dictable.  On  the  assumption  that  the  models 
underlying  the  other  ten  noise  passband- 
bandwidth/line  rate  combinations  are  of 
similar  form,  the  linear  equations  can  be 
used  with  the  realization  that  they  are 
not  perfect,  but  useful,  very  close 
approximations . 


Grouping  is  especially  evident  in  the  10.7, 
16.0,  and  20.4  TV  line/inch  threshold 
curves  for  the  16  MHz/945-line  system. 

The  equality  of  the  simple  linear  regres¬ 
sions  for  these  threshold  curves  should  be 
treated  further  and,  if  significant, 
explained . 


Af/N 

V 

Af 
_ n 

R 

Least  Squares  Regression  Equation 
N  -  SF  +  M  + 

32  MHz/1225 

0.0 

20.0 

MHz 

.92 

-.45 

235.58 

34.79 

0.0 

- 

5.0 

MHz 

.94 

-.32 

127.22 

22.83 

3.6 

- 

5.0 

MHz 

.93 

-.32 

225.79 

20.08 

3.6 

- 

10.0 

MHz 

.93 

-.48 

343.47 

31.46 

16  MHz/945 

0.0 

- 

5.0 

MHz 

.95 

-.48 

136.88 

29.30 

0.0 

- 

10.0 

MHz 

.90 

-.54 

184.95 

33.29 

3.6 

- 

5.0 

MHz 

.94 

-.43 

237.91 

23.71 

3.6 

- 

10.0 

MHz 

.91 

-.58 

370.03 

29.98 

8  MHz/525 

0.0 

_ 

5.0 

MHz 

.91 

-.56 

190.03 

40.99 

1.9 

- 

5.0 

MHz 

.91 

-.85 

325.51 

44.14 

3.6 

- 

5.0 

MHz 

.91 

-1.17 

337.51 

62.60 

Table  1.  Multiple  Linear  Regression 


Spot  Size  Limitation.  The  order  of 
square-wave  response  functions  for  these 
three  systems  (Figure  2)  is  reversed  from 
expectations,  due  largely  to  the  spot  size 
limitation  of  the  particular  monitor  used 
here.  Although  spot  size  measurements  were 
not  made,  it  is  very  likely  that  the  spot 
size  increased  with  the  increased  beam 
current  used  to  achieve  the  faster  writing 
speed  of  the  higher  line  rates.  This  offers 
a  possible  explanation  for  the  increase  in 
threshold  noise  level  with  decreasing 
bandwidth/ line  rate.  If,  for  the  sake  of 
argument,  the  spot  sizes  for  the  8  MHz/525- 
line  system  and  for  the  32  MHz/1225-line 
system  were  equal,  then  the  impairment  due 
to  a  given  amount  of  inserted  noise  would 
be  greater  in  the  32  MHz/1225-line  system 
since  the  greater  writing  speed  results  in 
a  larger  "snow  flake"  for  any  positive 
noise  pulse.  This  explanation  needs 
empirical  evaluation. 

Threshold  Grouping.  In  many  of  the 
mean  threshold  figures,  it  appears  that 
several  of  the  threshold  curves  coincide. 


SYSTEM  PHOTOMETRY 

Besides  describing  quantitatively  the 
target  modulation  and  spatial  frequency 
put  into  the  system  and  the  displayed 
modulation  and  spatial  frequency  at  the 
monitor,  the  relationship  between  the 
electrical  noise  inserted  into  the  system 
and  the  displayed  luminous  noise  was  also 
sought. 

Input  spatial  frequency  was  physi¬ 
cally  measured  on  the  photographic  prints. 
The  modulation  of  the  targets  on  these 
prints  was  measured  using  a  microphotom¬ 
eter  with  a  25  x  2500  micron  scanning- 
slit  eyepiece.  Spatial  frequencies  of  the 
targets  at  the  display  were  calculated  from 
the  input  spatial  frequency  and  the  system 
magnification.  A  sample  of  targets  was 
measured  at  the  display  to  verify  these 
calculations.  Target  modulation  at  the 
display  was  found  using  the  same  micro¬ 
photometer  and  scanning  eyepiece  as  used 
in  the  input  modulation  measurement  of  the 
photographic  prints.  This  displayed 
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modulation  had  to  be  measured  at  each  dif¬ 
ferent  bandwidth/line  rate  combination 
since  the  MTF's  changed. 

Fortunately,  the  output  rms  voltmeter 
of  the  noise  generator  proved  to  be  accurate 
as  checked  by  a  Ballantine  Model  323-01 
true  rms  voltmeter,  and  its  readings  were 
used  for  the  input  noise  level.  Corrections 
for  attenuation  of  the  noise  passband  were 
applied  in  the  data  analysis. 

Obviously,  the  size  of  the  scanning 
spot  is  also  an  important  parameter  of  any 
line-scan  display.  An  attempt  to  make  this 
measurement  was  made  using  a  high-efficiency 
microphotometer  with  a  double— slit  aperture 
(.003  x  .400  inch  with  .150  inch  spacing  at 
the  CRT).  Theoretically,  the  spot  passing 
the  two  slits  should  give  two  peaks  on  an 
oscilloscope  displaying  the  output  of  the 
photomultiplier.  From  these  two  peaks,  the 
spot  size  can  be  found.  In  this  particular 
CRT,  the  persistence  of  the  P-4  phosphor 
was  so  long  relative  to  the  speed  of  the 
spot  that  only  one  peak  was  obtained  on 
the  oscilloscope  for  each  passing  of  the 
spot  across  the  two  slits. 

This  photomultiplier  tube  output  was 
a  good  measure  of  the  variation  of  the  spot 
luminance  due  to  inserted  noise,  however. 

The  photograph  of  one  of  these  oscilloscope 
images  is  given  in  Figure  16.  The  output 
of  the  photometer  was  taken  directly  from 
the  dynode  of  the  photomultiplier  tube. 
Evidence  to  the  absence  of  saturation  of 
the  photomultiplier  tube  is  given  by  the 
lack  of  limiting  seen  in  the  peaks  in 
Figure  16.  With  the  photomultiplier  tube 
operating  in  its  linear  region  and  the 
results  displayed  on  a  calibrated  Tek¬ 
tronix  7403N  oscilloscope,  the  peak  height 
is  linearly  proportional  to  the  luminance 
of  the  spot  as  it  passes  the  aperture. 

The  standard  deviation  of  the  peak  heights 
when  converted  to  footlamberts,  is  the  rms 
luminance  of  the  spot  and  can  be  compared 
with  the  rms  noise  voltage  inserted. 

The  results  are  given  in  Figures  17 
through  27.  These  graphs  show,  for  each 
bandwid th/line  rate-noise  passband  com¬ 
bination  of  the  threshold  experiment,  the 
relationship  between  the  inserted  noise 
in  rms  volts  and  the  displayed  noise  in 
arbitrary  units  of  rms  luminance.  These 
measurements  were  made  against  both  large 


white  and  black  areas.  The  range  of 
inserted  noise  at  each  system  combination 
was  determined  by  the  approximate  range 
of  the  detectability  thresholds  at  these 
system  combinations. 


Figure  16.  Oscilloscope  Image  of  the  Photo 
multiplier  Output  Measuring 
Displayed  Noise 

Before  further  discussion  of  Figures 
17  through  27,  it  should  be  pointed  out 
that  there  are  data  missing  in  Figures  21, 
22,  and  23.  Some  points  near  zero-inserted 
noise  are  not  plotted  for  simplicity. 

Also,  the  abscissa  scale  is  0  to  .25  rms 
volts  in  Figures  14  through  20  and  0  to 
.50  rms  volts  in  Figures  21  through  27. 

Each  point  in  the  32  MHz/1225-line  system 
combinations  is  based  on  about  36  measured 
peaks,  each  point  in  the  16  MHz/945  line 
plots  is  based  on  about  28  peaks,  and 
about  16  peaks  were  measured  for  each 
plotted  point  in  the  8  MHz/525  line  data. 


Figure  17.  Rms  Luminance  as  a  Function 
of  Inserted  Noise 
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RELATIVE  RMS  LUMINANCE.  CRT  RELATIVE  RMS  LUMINANCE.  CRT 


Figure  18.  Rms  Luminance  as  a 
Function  of  Inserted  Noise 


Figure  19.  Rms  Luminance  as  a 
Function  of  Inserted  Noise 


INSERTED  NOISE,  RMS  VOLTS 
Figure  21.  Rms  Luminance  as  a 
Function  of  Inserted  Noise 


INSERTED  NOISE,  RMS  VOLTS 

Figure  22.  Rms  Luminance  as  a  Function 
of  Inserted  Noise 


Figure  20.  Rms  Luminance  as  a 
Function  of  Inserted  Noise 


Figure  23.  Rms  Luminance  as  a  Function 
of  Inserted  Noise 
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RELATIVE  RMS  LUMINANCE,  CRT  RELATIVE  RMS  LUMINANCE,  CRT  RELATIVE  RMS  LUMINANCE,  CRT 


24.  Rms  Luminance  as  a  Function 
of  Inserted  Noise 


Figure  25.  Rms  Luminance  as  a 
Function  of  Inserted  Noise 


Figure  26.  Rms  Luminance  as  a 
Function  of  Inserted  Noise 


Figure  27.  Rms  Luminance  as  a  Function 
of  Inserted  Noise 


An  inspection  of  these  graphs  indi¬ 
cates  that  application  of  any  linear 
relation  is  inappropriate.  In  most 
cases,  the  curves  are  made  up  of  two 
linear  portions.  For  instance,  in 
Figure  20  (32  MHz/1225  line,  3.6  to  10.0 
MHz  noise  passband) ,  increasing  inserted 
noise  up  to  about  .05  rms  volts  results 
in  a  significant  increase  in  the  ms 
luminance.  After  this  point,  increasing 
inserted  noise  results  in  a  slight  decline 
in  luminance  variation. 


Another  consideration  in  this  inter¬ 
pretation  of  the  results  is  that  the  mean 
luminance,  and  thus  the  modulation,  is 
not  constant  with  increasing  amounts  of 
inserted  noise.  As  the  inserted  noise 
increases,  the  mean  luminance  of  a  white 
area  remains  fairly  stable  but  the  mean 
luminance  of  a  black  area  increases  at  the 
same  rate  as  the  rms  luminance  to  about  the 
point  of  peak  rms  luminance  at  which  time 
both  the  rms  luminance  and  the  mean 
luminance  stabilize. 


The  relationships  among  rms  luminance, 
mean  luminance,  detectability  thresholds, 
and  target  modulation  need  further  study. 
There  may  also  be  an  interaction  here 
between  the  stability  of  the  mean  luminance 
and  rms  luminance  at  higher  noise  levels 
and  the  spot-size-limited  nature  of  this 
monitor. 
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Because  the  purpose  of  this  inves¬ 
tigation  into  the  display  photometry  was 
specification  of  display  parameter  values 
involved  in  finding  the  detectability 
thresholds,  photometric  noise  investigation 
was  limited  and  should  certainly  not  be 
considered  definitive. 

CONCLUSION 

As  expected,  at  any  system  and  noise 
passband  combination,  increasing  modulation 
at  a  particular  spatial  frequency  allowed 
an  increase  in  the  noise  threshold;  and  at 
any  particular  modulation,  increasing 
spatial  frequency  brought  about  a  decrease 
in  the  detectability  threshold. 

The  ordering  of  the  square-wave 
responde  curves  caused  a  corresponding 
ordering  of  the  detectability  thresholds. 
The  lowest  overall  square-wave  response  was 
that  of  the  32  MHz/1225  line  system  and 
the  lowest  overall  noise  detectability 
thresholds  were  also  for  the  32  MHz/1225 
line  system. 

Noise  passband  is  very  important  in 
setting  the  detectability  thresholds. 

The  data  from  the  different  noise  pass- 
bands  indicate  that  lower  frequency 
noise,  less  than  2  MHz,  caused  the 
greatest  increase  in  the  noise  thresholds. 

In  the  Introduction,  it  was  stated 
that  the  principal  objective  of  this  work 
was  generalization  of  the  detectability 
thresholds  to  a  variety  of  systems.  The 
achievement  of  this  objective  is  probably 
somewhat  limited  due  largely  to  the  spot- 
size-limited  display  and,  in  part,  to 
the  number  of  target  spatial  frequencies 
finally  used.  The  degree  of  limitation 
from  the  spot-size  problem  should  be 
investigated.  It  is  apparent  that  the 
scanning  spot  size  and  shape  are  as  im¬ 
portant  in  specifying  line-scan  display 
image  quality  as  are  line  rate  and  band¬ 
width.  It  is  also  interesting  that  the 
square-wave  response  curves  gave  proper 
consideration  to  this  spot-size  deficiency 
in  ranking  the  three  systems. 


Finally,  the  photometry  of  displayed 
noise  reported  here  was  done  only  to 
quantify  the  stimuli  used  to  find  the 
detectability  threshold.  There  is  a 
definite  further  need  to  find  the  relation¬ 
ship  between  electrical  noise  and  actual 
displayed  noise.  Certainly,  it  is  true 
that  research  which  assumes  linearity 
between  inserted  rms  electrical  noise  and 
displayed  rms  luminous  energy  noise  may 
be  in  error,  and  should  be  reevaluated. 
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ABSTRACT 

A  single  measure  of  image  quality,  the 
square-wave  modification  of  the  modulation 
transfer  function  area,  is  shown  to 
correlate  highly  with  observer  performance 
in  extracting  information  from  both  static 
and  dynamic  imagery.  This  paper  describes 
this  unitary  measure  of  image  quality  and 
discusses  the  results  of  the  experiments 
which  establish  its  validity. 

INTRODUCTION 

During  the  past  two  decades,  over  300 
laboratory,  analytical,  and  field  studies 
have  been  performed  to  assess  the  relation¬ 
ship  between  variation  in  line-scan 
display  image  parameters  and  some  measure 
of  observer  performance.  Conclusions 
drawn  from  critical  reviews  of  these 
studies  (e.g.,  Snyder,  et  al.,  1967; 
Hairfield,  1970;  Filleau,  1970)  have  indi¬ 
cated  that  cross-study  comparisons  are 
virtually  impossible.  Variation  in 
specific  system  design  parameters,  or  in 
the  manner  by  which  display  image  quality 
is  synthetically  manipulated,  is  often 
incompletely  controlled,  so  that  con¬ 
comitant  variation  in  the  several  contri¬ 
buting  sources  of  image  quality  results. 

Many  experimental  variables  have  been 
shown  to  have  a  significant  effect  upon 
operator  information  extraction  performance. 
Among  these  are  video  bandwidth,  number  of 
scan  lines,  field  of  view,  signal-to-noise 
level  in  the  video,  frame  integration  time, 
display  contrast,  scene  rate  of  movement, 
and  display  size.  For  the  most  part, 
individual  experiments  have  tended  to 


examine  the  effects  of  one,  two,  and 
sometimes  three  of  these  variables. 
However,  due  to  the  inherent  inter¬ 
action  (nonindependence)  among  these 
variables,  both  in  terms  of  engineering 
design  and  human  performance,  quanti¬ 
tative  combination  of  the  results  is 
hazardous  even  in  the  presence  of  good 
experimental  control  and  measurement. 

In  the  absence  of  such  control,  any  a 
posteriori  attempt  to  combine  the 
results  is  merely  foolish. 

Because  of  these  gross  conflicts 
and  inconsistencies  in  the  experimental 
literature  dealing  with  the  effects  of 
individual  system  parameters,  recent 
efforts  have  been  oriented  toward  the 
development  of  (1)  analytical  expres¬ 
sions  of  overall  image  quality,  such 
as  those  discussed  by  others  here  at 
this  meeting  and  (2)  experimental 
evaluations  of  logically-derived 
summary  measures  of  image  quality.  The 
remainder  of  this  paper  will  discuss 
the  present  approach  and  limitations  of 
data  pertaining  to  one  such  summary 
measure  of  image  quality. 

Any  summary  measure  of  image 
quality,  to  be  useful,  must  be  (1) 
easily  measured  for  existing  imaging 
systems,  (2)  quantitatively  predictable, 
analytically,  for  future  imaging 
systems  at  the  paper  design  stage,  and, 
in  particular,  (3)  highly  correlated 
with  (or  validated  by)  empirically- 
determined  operator  performance  under 
the  dynamic  operational  conditions  of 
interest  for  the  specified  mission. 

To  date,  one  summary  measure  of  image 
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quality  which  shows  promise  for  meeting 
these  criteria  is  the  Modulation  Transfer 
Function  Area. 

Modulation  Transfer  Function  Area  (MTFA) 

Originally  proposed  by  Charman  and 
Olin  (1965)  who  termed  it  the  threshold 
quality  factor,  and  renamed  by  Borough, 
Fallis,  Warnock,  and  Britt  (1967),  the 
MTFA  concept  has  been  evaluated  in  two 
experimental  photointerpretation 
situations  and  demonstrated  to  relate 
strongly  to  the  ability  of  image  inter¬ 
preters  to  obtain  critical  information 
from  reconnaissance  photographic  imagery. 

In  its  original  form,  the  MTFA  was  proposed 
as  a  unitary  measure  of  photographic  image 
quality  which  contains  "the  cumulative 
effect  of  the  various  stages  of  the 
atmosphere-camera-emulsion-development- 
observation  process,  the  ’noise’  introduced 
in  the  perceived  image  by  photographic 
grain,  and  the  limitations  imposed  by  the 
physiological  and  psychological  systems  of 
the  observer"  (Charman  and  Olin,  1965,  p. 
385) .  While  this  measure  was  originally 
developed  for  direct  photographic  systems, 
its  generalization  to  electro-optical 
systems  is  possible  with  some  modifications. 

The  MTFA  is  derived  in  such  a  manner 
as  to  make  use  of  the  modulation  transfer 
function  (MTF)  of  the  imaging  system, 
thereby  retaining  the  analytical  convenience 
of  component  analysis  based  upon  sine-wave 
response  characteristics.  In  addition,  it 
attempts  to  take  into  account  other 
variables  critical  to  the  imaging  and 
interpreting  problem,  such  as  exposure, 
the  characteristic  curve,  granularity,  the 
human  observer  capabilities  and  limitations, 
and  the  nature  of  the  interpretation  task. 
For  the  electro-optical  system,  the  first 
three  of  these  variables  can  be  considered 
analogous  to  detector  irradiance  level, 
gamma  (typically  unity),  and  noise, 
respectively. 

Figure  1  shows  that  the  MTFA  is  the 
area  bounded  by  the  imaging  system  MTF 
curve  and  the  detection  threshold  curve  of 
the  total  system,  including  the  eye.  The 
MTF  curve  for  the  imaging  system  is 
obtained  in  the  conventional  manner,  while 
the  detection  threshold  curve  requires 
several  assumptions  regarding  the  human 
operator.  Specifically,  it  is  assumed 


that  the  viewing  conditions  are  optimum, 
and  that  threshold  detection  of  any 
target  in  the  imaged  display  is  a  function 
of  the  target  image  contrast  modulation, 
the  noise  in  the  observer  visual  system, 
and  the  noise  in  the  imaging  system 
exclusive  of  the  observer.  It  should  be 
noted  that  the  crossover  of  the  two  curves 
in  Figure  1  represents  the  limiting 
resolution  of  the  system  for  a  sine-wave 
target . 


SPATIAL  FREQUENCY  - ► 

Figure  1.  Modulation  transfer  function 
area  (MTFA) . 

At  low  spatial  frequencies,  the 
threshold  detection  curve  is  dependent 
upon  the  properties  of  the  human  visual 
system,  as  shown  in  Figure  2.  At  higher 
spatial  frequencies,  the  effect  of  imaging 
system  noise  becomes  important.  For  the 
photographic  case,  this  imaging  system 
noise  is  equivalent  to  granularity.  It  is 
assumed  further  that  the  eye's  contrast 
threshold  is  0.04,  so  that  this  target 
image  contrast  modulation  must  be  realized 
at  the  display  for  the  target  to  be 
detected,  regardless  of  the  contrast 
modulation  of  the  target  object. 

Figure  2  illustrates  the  normalized 
detection  threshold  curve,  which  must  be 
adjusted  both  vertically  and  horizontally 
for  a  specific  set  of  conditions.  The 
curve  is  repositioned  vertically,  by 
moving  it  upward  for  decreases  in  scene 
object  modulation,  and  by  moving  it 
downward  for  increases  in  system  gamma. 

The  detection  threshold  curve  is  also 
moved  horizontally  to  the  left  with  in¬ 
creases  in  film  granularity,  or  noise. 
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Details  of  these  manipulations  are  given 
by  Borough,  et  al.  (1967)  and  Snyder  (in 
press) . 


Figure  2.  Generalized  detection  threshold. 

Following  positioning  of  the  threshold 
detection  curve,  the  MTFA  is  calculated  by: 

"l  M  00 

MTFA  ■ '  -  5roo> dli  (1> 

U  o 

where  -  the  spatial  frequency  at  which 
the  MTF  curve  crosses  the 
detection  threshold  curve 
(limiting  resolution) 

*  the  MTF  value  at  spatial  fre¬ 
quency  N 

Mt (N)  *  the  normalized  threshold  value, 
as  shown  on  Figure  2 

M  (N)  -  the  inherent  object  contrast 
modulation 

It  should  be  noted  that  the  detection 
threshold  curve,  as  described  here,  is 
akin  to  such  concepts  as  contrast  sensi¬ 
tivity  (Campbell  and  Green,  1965),  sine- 
wave  response  (DePalma  and  Lowry,  1962; 
Lowry  and  DePalma,  1961;  Bryngdahl,  1966), 
and  demand  modulation  function. 

Evaluation  of  the  MTFA  for  Photographic 

Imagery 

To  date,  two  empirical  evaluations  of 
the  MTFA  concept  have  been  conducted,  both 
using  photographic  imagery.  In  the  first 
study,  an  attempt  was  made  to  relate  the 


MTFA  to  subjective  estimates  of  image 
quality  obtained  from  a  large  number  of 
trained  image  interpreters.  In  the  second 
of  these  experiments,  actual  information- 
extraction  performance  data  were  obtained, 
as  well  as  subjective  estimates  of  image 
quality,  and  both  measures  were  compared 
with  the  MTFA  values  of  the  imagery. 

In  the  Borough,  et  al.  (1967)  experi¬ 
ment  to  evaluate  MTFA,  the  purpose  was  to 
determine  whether  a  strong  relationship 
existed  between  MTFA  and  subjective  image 
quality.  This  limited  evaluation  was 
imposed  simply  to  reduce  data  collection 
costs  in  the  event  that  the  MTFA  measure 
proved  fruitless.  Nine  photographic 
reconnaissance  negatives  were  used  as  the 
basis  for  laboratory-controlled  manipu¬ 
lation  of  image  quality.  Each  of  the 
scenes  was  printed  in  32  different  MTFA 
variants,  determined  by  four  different 
MTF's,  three  levels  of  granularity,  and 
three  levels  of  contrast,  as  illustrated 
in  Figure  3.  Four  cells  of  the  matrix 
were  deleted  because  their  MTFA  values 
corresponded  to  others  in  the  32-cell 
matrix.  The  MTF  curves  are  illustrated  in 
Figure  4. 


Figure  3.  Production  of  MTFA  conditions. 

The  resulting  288  transparencies 
(9  scenes  by  32  variants/scene)  were  used 
in  a  partial  paired-comparison  evaluation 
by  36  experienced  photointerpreters. 

The  subjects  were  asked  to  select  the 
photo  of  each  pair  that  had  the  best 
quality  for  extraction  of  intelligence 
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information.  All  pairs  were  composed  of 
two  variants  of  the  same  scene;  each  sub¬ 
ject  made  a  total  of  256  comparisons,  for 
a  grand  total  for  all  subjects  of  36  X  256 
=  9216  judgements. 
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Figure  4.  Average  MTF  curves  produced  by 
edge-response  method. 

Correlations  were  obtained  between 
the  subjective  image  quality  rating 
(derived  from  the  paired-comparisons)  for 
each  of  the  32  variants  and  several 
physical  measures  of  image  quality.  Most 
important  to  this  discussion  is  the  obtained 
mean  correlation  of  0.92  between  MTFA  and 
subjective  image  quality,  which  indicates 
that  the  MTFA  is  strongly  related  to 
subjective  estimates  of  image  quality. 

The  next  experiment,  by  Klingberg, 
Elworth,  and  Filleau  (1970),  examined  the 
relationship  between  objectively-measured 
information-extraction  performance  and  the 
MTFA  values.  As  a  check  on  the  results  of 
Borough,  et  al.  (1967),  Klingberg,  et  al. 
(1970)  also  obtained  subjective  estimates 
of  image  quality,  so  that  all  three  inter¬ 
correlations  were  evaluated. 

The  imagery  used  for  this  experiment 
was  the  same  as  that  used  by  Borough, 


et  al.  (1967).  A  group  of  384  trained 
military  photointerpreters  served  as 
subjects.  Each  subject  was  given  one 
variant  of  each  of  the  nine  scenes  and 
asked  to  (1)  rank  the  image  on  a  nine-point 
interpretability  scale,  using  utility  of 
image  quality  for  information  extraction 
as  the  criterion,  and  (2)  answer  each  of 
eight  multiple-choice  questions  dealing 
with  the  content  of  the  scene.  The 
interpretability  scale  values  were  used 
to  develop  a  subjective  image  quality 
measure  for  the  288  images,  while  scores 
on  the  multiple-choice  interpretation 
questions  were  used  to  measure  information 
extraction  performance. 

Figure  5  shows  the  scattergram  between 
information  extraction  performance  and 
MTFA  for  the  32  MTFA  values.  The  resulting 
product-moment  correlation,  averaged 
across  the  nine  scenes,  is  -0.93.  (The 
minus  value  is  due  to  the  use  of  number  of 
errors  as  a  measure,  which  is  inversely 
related  to  MTFA.)  Further,  the  obtained 
product-moment  correlation  of  0.96  between 
MTFA  and  subjective  quality  (rank)  agrees 
quite  well  with  the  correlation  previously 
obtained  by  Borough,  et  al.  (1967),  of 
0.92.  The  correlation  between  subjective 
quality  and  the  number  of  errors  is  0.97. 


Figure  5.  Scattergram,  information 

extraction  performance  vs. 

MTFA. 

These  data  show  unequivocally  that  a 
measure  of  image  quality  based  upon  the 
excess  of  MTF  over  the  threshold  detection 
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level  correlates  highly  with  the  ability 
of  observers  to  obtain  critical  operational 
information  from  photographic  imagery. 

The  remaining  question,  of  course,  is 
whether  the  MTFA  concept  can  be  validly 
used  to  predict  typical  observer  perfor¬ 
mance  from  an  electro-optical  line-scan 
display. 

THE  MTFA  AND  LINE-SCAN  DISPLAYS 

The  application  of  the  MTFA  concept  to 
line-scan  sensors  and  displays  is  not 
straightforward,  either  analytically  or 
experimentally.  First,  there  exists  no 
threshold  detection  curve  for  the  line- 
scan  display  situation  in  which  the 
targets  are  sinusoidally  varying  in 
intensity.  Second,  there  is  no  easy  and 
perhaps  valid  derivation  of  the  effects  of 
variables  such  as  the  number  of  active 
scanning  lines,  the  noise  passband,  the 
video  passband,  and  the  target  modulation 
upon  the  detectability  threshold  curve. 
Generalization  from  the  concepts  offered 
by  Charman  and  Olin  (1965)  cannot  be  made 
with  any  confidence.  Finally,  the  Charman 
and  Olin  threshold  curves  imply  optimum 
viewing  by  the  subject,  including  optimum 
display  size  (image  magnification), 
unlimited  viewing  time,  ideal  non-glare 
ambient  illumination,  and  no  annoying 
environmental  conditions  (e.g.,  noise, 
vibration,  etc.). 

For  these  reasons,  our  approach  to 
employing  the  MTFA  concept  has  been  to 
modify  it  as  needed  to  better  fit  the 
line-scan  display  case,  both  conceptually 
and  from  a  measurement  standpoint,  and  to 
determine  empirically  the  appropriate 
threshold  detection  curves  for  our  parti¬ 
cular  viewing  conditions.  R.  L.  Keesee's 
paper,  at  this  Symposium,  will  discuss  the 
methodology  and  results  of  our  experiments 
performed  to  generate  the  threshold  curves, 
while  the  remainder  of  this  paper  will 
present  the  concepts  and  experiments  used 
to  validate  those  concepts. 

Modification  of  the  MTFA  Concept 

The  use  of  a  sine-wave  pattern  to 
generate  both  the  MTF  curve  for  a  given 
system  and  to  determine  detection  thres¬ 
hold  curves  is  quite  difficult,  in  part 
because  it  is  difficult  to  obtain  or 
produce  accurate  sinusoidally-varying 


intensity  patterns  on  hard  copy.  For  this 
reason,  and  also  because  it  is  reasonably 
easy  to  transform  square-wave  response 
data  into  approximately  equivalent  sine- 
wave  response  data,  we  have  been  using 
square-wave  inputs  exclusively.  Our 
experimental  technique  is  summarized  in 
Figure  6.  Simply,  a  variable  parameter 
television  system  is  used  to  view  either 
8xl0-inch  hard  copy  prints  or  a  dynami¬ 
cally-varying  scene  contained  on  35mm 
motion-picture  imagery  which  was  taken 
from  the  North  American  Aviation/Columbus 
terrain  model  at  varying  speeds,  altitudes, 
fields  of  view,  and  depression  angles 
(Snyder,  1967).  System  parameters  such  as 
video  bandwidth,  active  line  number,  con¬ 
trast,  etc.,  can  be  selectively  varied  by 
controls  at  either  the  camera  or  the 
camera  control  unit.  The  noncomposite 
video  is  mixed  with  a  shaped  noise  signal 
by  a  wide-bandpass  video  mixer,  added  to 
the  sync  and  blanking  signals,  and  sent  to 
a  variable-parameter  17-inch  display 
located  40  inches  from  the  subject’s 
nominal  eye  position. 
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Figure  6.  Simulation  system  block 
diagram. 

The  video  signal  is  monitored  at 
several  places  and  displayed  on  a  wide- 
bandwidth  oscilloscope,  the  face  of  which 
is  photographed  for  subsequent  evaluation. 
These  test  points  permit  measurement  of 
rms  noise,  peak-to-peak  signal,  signal 
plus  noise  waveform,  and  camera  raw 
video  output. 

All  camera  and  camera  control  unit 
adjustments  are  checked  every  30  minutes 
during  an  experimental  run  to  assure 
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stability  both  electrically  and  photometri¬ 
cally.  The  photometric  quality  of  the  dis¬ 
play  is  measured  with  a  telephotometer  to 
assure  constant  gray-scale  rendition,  using 
an  11-level  gray-scale  chart.  The  photo¬ 
metric  output  of  the  system,  relative  to 
a  square-wave  1951  USAF  tribar  target 
input,  is  measured  photometrically  with  a 
scanning  eyepiece  by  a  microphotometer, 
recording  the  analog  output  on  an  X-Y 
plotter.  Thus,  the  system  square-wave 
response  is  measured  photometrically, 
rather  than  electrically.  As  Keesee’s 
paper  shows,  it  is  not  necessarily  safe  to 
assume  linearity  between  electrical  input 
and  photometric  output  of  a  cathode-ray 
tube.  Our  emphasis  on  the  photometric 
output  is  simply  because  the  visual  stimu¬ 
lus  is  photometric,  not  electrical,  and  we 
are  ultimately  interested  in  measuring 
human  performance. 

Employing  these  techniques,  and  vary¬ 
ing  several  system  parameters,  we  can 
generate  a  variety  of  system  square-wave 
response,  Rgg(N),  curves,  the  experimental 
results  from  which  are  discussed  in  the 
remainder  of  this  paper. 

Evaluation  of  the  Modified  MTFA  for 

Line-Scan  Displays 


In  one  experiment,  five  different 
noise  levels  were  used  with  a  1225-line, 
32-MHz  bandwidth  video  system  configuration. 
The  system  square-wave  response  Ron(N)  and 
the  square-wave  detection  threshold  curves 
are  shown  in  Figure  7.  The  imagery  dis¬ 
played  was  an  air-to-ground  dynamic  search 
scene,  simulating  an  altitude  of  10,000 
feet,  a  groundspeed  of  500  feet  per 
second,  a  camera  depression  angle  of  28 
degrees,  and  a  camera  field  of  view  of  30 
degrees  diagonal  on  a  3:4  (horizontal: 
vertical)  aspect  ratio.  The  video  signal- 
to-noise  levels,  for  a  100%  contrast 
target,  were  32,  9.14,  4.27,  2.91,  and 
2.13  decibels. 

Twenty-eight  tactical  and  strategic 
ground  targets  were  selected,  and  11  dif¬ 
ferent  subjects  were  used  for  each  of  the 
five  noise  levels,  each  subject  searching 
for  each  of  the  28  targets.  The  subjects’ 
performance  was  measured  in  terms  of  the 
likelihood  of  a  correct  response  (number 
of  correct  responses  divided  by  28)  and 
the  calculated  range  to  the  target  at  the 


time  of  a  correct  recognition  response. 
All  targets  were  pre-briefed,  with  the 
subjects  having  studied  pictures  of  the 
targets  prior  to  the  experiment. 


Figure  7.  System  Rgq(N)  and  threshold 
detection  curves  for  five 
noise  levels. 

The  results  of  the  experiment,  sum¬ 
marized,  were  that  the  modified  MTFA 
measure  (MTFAgq)  correlates  0.965 
with  the  likelihood  of  recognition,  and 
0.76  with  the  range  of  recognition. 

In  a  second  experiment,  using  static 
pictures  of  human  faces,  similar  results 
were  found.  In  this  study,  we  were 
interested  in  the  application  of  image 
quality  to  the  recognition  of  a  single 
face,  as  in  a  police-surveillance  task 
from  low-light-level  video  tape  imagery. 
Static  facial  photos  were  placed  before 
the  television  camera,  and  the  subjects 
were  required  to  identify  the  face  on  the 
display  from  a  ’’rogues’  gallery”  set  of 
35  faces.  Three  different  system  config¬ 
urations  were  used,  with  each  configuration 
studied  at  five  different  noise  levels,  as 
illustrated  in  Figures  8  through  10. 

Table  1  summarizes  the  experimental  design 
and  conditions,  indicating  that  each  of 
five  subjects  was  asked  to  recognize  each 
of  the  35  faces,  in  a  random  order,  three 
different  times,  once  for  each  system 
(under  a  different  noise  level) .  Order  of 
presentation  and  selection  of  the  facial 
stimuli  for  each  subject  and  for  each 
condition  were  appropriately  counter¬ 
balanced  . 
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Table  1.  Experimental  design,  facial 
recognition  experiment. 


Figure  8.  System  RSq(N)  and  threshold 

detection  curves,  8/525  system. 
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Figure  9.  System  Rcq (N)  and  threshold 

detec tion  curves ,  16/945  system. 


Figure  10.  System  RSg(N)  and  threshold 
detection  curves,  32/1225 
system. 


The  results  of  this  facial  recog¬ 
nition  experiment  are  shown  in  Figures  11 
and  12.  The  product-moment  correlation 
between  the  fifteen  experimental  con¬ 
ditions1  MTFASq  values  and  the  probability 
of  recognizing  the  face  is  0.69,  although 
the  relationship  is  obviously  nonlinear 
(Figure  11).  Similarly,  Figure  12  shows 
that  the  relationship  between  MTFAgg  and 
response  time  is  orderly,  but  nonlinear, 
with  a  correlation  of  -0.67.  Applying 
transformations  to  each  plot  to  approxi¬ 
mate  linearity,  and  replotting  the  data  as 
Figures  13  and  14,  it  is  shown  that  the 
correlation  between  log-^MTFAgg  and  the 
probability  of  correct  recognition  is 
0.87  (Figure  13)  and  the  correlation 
between  log^gHTFAgg  and  log-^Q  response 
time  is  -0.92  (Figure  14).  Both  corre¬ 
lations  are  statistically  significant  at 
the  0.001  level  of  confidence.  The 
nonlinear  best-fit  equation  is  of  no 
consequence  in  terms  of  systematic  pre¬ 
diction;  it  is  only  important  that  a  high 
correlation  exist  to  validate  the  concept 
of  the  metric. 
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Figure  11.  MTFAgg  vs.  probability  of 

correct  recognition  of  faces. 


109 


PROPORTION  CORRECTLY  RECOGNIZED 


mtfasq 

Figure  12.  MTFAgQ  vs.  response  time  for 
facial  recognition. 
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Figure  13.  MTFASf)  vs.  log^Q  Probabilit:y 
of  correct  recognition. 


Figure  14.  Log1Q  MTFASq  vs.  log10 
response  time. 


CONCLUSION 

In  general,  then,  we  can  conclude 
that  the  revised  metric  based  upon  a 
square-wave  response,  MTFAcq,  is  a  strong 
predictor  of  image  quality  ror  raster-scan 
displays,  just  as  it  is  for  continuous- 
sampled  photographic  imagery.  In  fact, 
under  certain  conditions,  such  as  when 
system  gamma  is  unity  and  the  MTFAcq  is 
transformed  into  an  equivalent  sine-wave 
metric,  the  MTFA  concept  is  very  similar 
to  the  integral  under  the  SNRp  curve  which 
was  discussed  by  Rosell  at  this  Symposium. 
Using  a  periodic  target,  such  as  a  tribar, 
the  two  concepts  permit  prediction  of  the 
human  observers  performance  with  high 
validity.  Rosell* s  studies  of  threshold 
detection  for  static  images,  and  our 
studies  of  recognition  for  both  static  and 
dynamic  images  both  show  that  this  type  of 
metric,  based  upon  the  system  square-wave 
or  sine-wave  response,  relative  to  the 
observer* s  requirement  for  contrast  at  a 
given  spatial  frequency,  are  accurate 
predictors  of  observer  performance.  In 
general,  whatever  system  gives  the  greatest 
excess  of  contrast  (or  modulation)  over 
the  eye*s  requirement  at  the  spatial  fre¬ 
quency  of  interest  will  produce  the  best 
performance. 
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ABSTRACT 

One  of  the  early  attempts  to  provide 
a  quantitative  model  for  predicting  the 
capabilities  of  an  observer  in  looking  for 
pre-brief ed  targets  is  described.  This 
model  is  structured  according  to  three 
distinguishable  psychophysical  processes: 
deliberate  search  over  a  fairly  well-de¬ 
fined  area;  detection  of  contrasts  (a 
subconscious  retino-neural  process) ;  and 
recognition  of  shapes  outlined  by  the  con¬ 
trast  contours  (a  conscious  decision  based 
on  comparison  with  memory) .  The  probabil¬ 
ity  that  the  three  steps  are  completed 
successfully  gives  the  probability  of  tar¬ 
get  detection.  The  foregoing  is  essen¬ 
tially  a  static  model.  Modifications  to 
allow  for  the  dynamics  of  flight  approach¬ 
ing  a  target,  and  for  individual  target 
motion  relative  to  the  background,  are 
also  described. 

INTRODUCTION 

Human  vision  is  achieved  by  a  remark¬ 
able  electro-optical  system  that  has 
evolved  naturally  to  a  form  that  is  far 
more  versatile  than  anything  we  have  been 
able  to  build  ourselves.  Everyone  is  aware 
of  the  two  classes  of  primary  receptors, 
the  efficient  distribution  of  the  high- 
resolution  components,  the  huge  range  of 
luminance  levels  over  which  the  system  can 
function,  and  so  forth. 

There  is  an  additional  feature  of  the 
dynamic  range  capability  that  has  been 
brought  to  light  in  a  little-known  funda¬ 
mental  paper  (Ory,  1969).  Using  modern 
statistical  decision  theory,  Ory  was  able 
to  obtain  an  excellent  fit  to  the  6-sec/8- 
position  Tiffany  data  by  making  the  follow¬ 
ing  assumptions:  primarily,  the  decision 


criterion  is  assumed  to  be  adaptable,  or 
programmed,  and  to  depend  on  both  the 
luminance  (total  number  of  neural  exci¬ 
tations  per  unit  time)  and  its  fluctua¬ 
tions  (square  root  of  number  of  excita¬ 
tions  per  unit  time);  in  addition,  both 
the  background  and  the  target  luminance 
and  fluctuations  are  to  be  included,  and 
the  image  extension  due  to  both  optical 
aberrations  and  the  retinal  neural  organi¬ 
zation  must  be  accounted  for.  With  these 
assumptions,  all  of  the  experimental  data 
on  the  threshold  luminance  (AL02t)  can  be 
plotted  on  two  curves,  one  for  foveal- 
photopic  vision  and  one  for  extrafoveal- 
scotopic  vision.  Each  curve  is  a  simple 
quadratic  in  the  total  fluctuation  rate, 
and  one  of  them  has  a  small  constant 
curve. 

The  interesting  point  here  is  that 
these  curves  define  two  regions,  depend¬ 
ing  on  whether  the  fluctuation  or  the 
luminance  term  is  dominant;  and  these 
correspond  to  the  DeVries-Rose  and  the 
Weber-Fechner  regimes  of  the  literature. 
These  regimes  are  interpreted  as  follows: 
At  low  light  levels,  discriminable  incre¬ 
ments  are  limited  by  the  noise  level, 

AL  ~  and  system  performance  is  deter¬ 
mined  by  the  signal-to-noise  ratio,  AL !\Jh, 
As  the  luminance  is  increased,  the  number 
of  discriminable  levels  rises  as  the 
square  root,  until  another  limit  sets  in — 
the  number,  n,  of  levels  that  our  visual 
equipment  can  process  simultaneously. 

From  this  point  on,  the  discriminable  in¬ 
crements  are  just  the  fraction  1/n  of  the 
maximum  luminance,  AL  ~  L,  and  system  per¬ 
formance  is  limited  by  the  contrast,  AL/L. 

Now,  with  modern  man-made  electro- 
optical  systems,  despite  the  fact  that  the 
display  is  bright  enough  for  the  eye  to 
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operate  photopically ,  there  is  so  much 
noise  in  any  practical  system  that  the  over¬ 
all  performance  is  again  noise-limited.  The 
number  of  distinguishable  grey  levels  on 
some  of  the  old  radar  scopes  was  3  or  4  at 
most,  and  now  we  struggle  to  get  7  or  8; 
but  these  are  still  well  below  the  number 
(probably  around  20)  where  the  human  pro¬ 
cessing  system  imposes  its  contrast  limi¬ 
tation. 

The  conclusion  is  that  signal-to-noise 
is  the  appropriate  parameter  for  analyzing 
the  capabilities  of  an  observer  viewing  an 
electro-optical  display;  but  it  is  still 
appropriate  to  use  contrast  when  you  look 
out  the  window. 

Now  I  will  go  back  and  describe  my  old 
observer  model.  This  was  primarily  intended 
for,  and  is  best  applied  to,  the  contrast 
regimes.  A  description  of  this  model  was 
first  published  in  1970  (Bailey,  1970); 
however,  it  was  developed  during  the  mid¬ 
sixties  and  was  used  internally  at  Rand 
prior  to  its  publication.  This  paper  gives 
an  abridged  description  of  the  model,  em¬ 
phasizing  the  concepts  involved,  together 
with  two  extensions  which  have  been  added 
since  that  time. 

GENERAL  FORM  OF  THE  MODEL 

The  performance  of  a  human  observer  is 
often  a  very  complicated  function  of  many 
interacting  variables.  In  order  to  simpli¬ 
fy  this  difficult  situation  and  yet  stay 
close  to  reality,  we  consider  explicitly 
the  task  of  finding  known  and  fixed  objects 
in  a  complex  field  in  a  short  time.  This 
process,  even  when  so  restricted,  is  still 
complex,  but  it  can  be  considered  to  con¬ 
sist  of  the  following  three  distinct  steps: 
deliberate  search  over  a  fairly  well-de¬ 
fined  area;  detection  of  contrasts  (a  sub¬ 
conscious  retino-neural  process) ;  and 
recognition  of  shapes  outlined  by  the  con¬ 
trast  contours  (a  conscious  decision  based 
on  comparison  with  memory) . 

On  the  basis  of  assorted  experimental 
data,  three  formulas  can  be  devised  for  the 
probabilities  of  completing  each  of  the 
three  steps  separately.  It  is  postulated 
that  the  overall  target  recognition  proba¬ 
bility  can  be  expressed  by  the  product  of 
these  three  terms.  Accordingly,  we  estab¬ 
lish  the  following  definitions: 


•  P^  is  the  probability  that  an 
observer,  searching  an  area  that  is  known 
to  contain  a  target,  looks  for  a  specified 
glimpse  time  (viz.,  1/3  sec)  in  the  direc¬ 
tion  of  the  target  with  his  foveal  vision. 

P^  is  a  function  of  the  ratio  of  an  accept¬ 
able  search  rate  to  that  demanded  of  a 
given  situation;  the  loosely  defined  con¬ 
cept  of  foveal  vision  is  replaced  by  that 
of  an  effective  glimpse  aperture. 

•  P2  is  the  probability  that  if  a 
target  is  viewed  foveally  for  one  glimpse 
period  it  will,  in  the  absence  of  noise 
(i.e.,  S/N  >>  1),  be  detected.  P2  is  de¬ 
termined  by  psychophysical  limits  operat¬ 
ing  on  the  observed  target  size  and  con¬ 
trast. 

•  P3  is  the  probability  that  if  a 
target  is  detected  it  will  be  recognized 
(again  during  a  single  glimpse  and  in  the 
absence  of  noise).  Recognition  is  usually 
(but  not  necessarily)  accomplished  on  the 
basis  of  intrinsic  shape  without  reliance 
on  context. 

We  then  write  for  Pr,  the  probability 
of  target  recognition, 

PR  =  PlxP2xV  (1) 

Inasmuch  as  the  three  steps  described 
above  are  independent  events,  P2  and  P3 
(as  defined)  each  representing  a  condi¬ 
tional  probability  under  the  one  preceding 
it,  the  product  formulation  of  Eq.  (1)  is 
obvious  and  rigorously  correct.  This  is 
so  despite  the  fact  that  the  individual 
terms  are  not  completely  independent  in  the 
sense  that  they  may  be  functions  of  some  of 
the  same  variables  (contrast,  for  example). 
This  and  certain  other  subtle  interactions 
are  discussed  briefly  in  a  later  section. 

In  the  following  paragraphs  the  nature  of 
each  of  the  three  terms  is  examined  in  some 
detail,  and  a  specific  analytical  expres¬ 
sion  is  developed  for  each  one. 

The  Search  Term 

The  first  term,  P3,  describes  the 
search  limitations.  When  searching  from  the 
air  for  a  terrestrial  object  whose  loca¬ 
tion  is  known  only  approximately,  it  be¬ 
comes  both  possible  and  necessary  to  util¬ 
ize  foveal  vision  and  to  search  fairly 
systematically.  The  maximum  acuity  of 
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foveal  vision  is  a  necessity,  since  there 
is  always  a  need  to  find  targets  at  the 
earliest  possible  moment  during  approach, 
and  at  long  range  either  the  apparent  size 
or  the  available  contrast  or  both  may  be 
marginal;  few  military  targets  really 
stand  out.  Foveal  vision  is  also  usually 
feasible,  since  only  a  limited  area  needs 
to  be  covered.  The  required  area  may  be 
as  much  as  the  whole  of  an  electro-opti¬ 
cal  display,  but  more  commonly  it  is  an 
area  set  by  navigation  errors  and  target 
location  uncertainties,  centered  on  a 
predicted  or  expected  target  location. 

Even  under  these  conditions,  however, 
search  rates  are  extremely  variable  and 
almost  intractable  for  the  fundamental 
reason  that  pieces  of  terrain  (not  to  men¬ 
tion  possible  targets)  differ  widely  and 
almost  defy  quantification.  Nevertheless, 
some  bounds  can  be  set. 

It  is  well  known  that  the  eye  moves 
in  discrete  steps,  ordinarily  with  about 
three  stops,  called  fixations,  per  second. 
(Actually  an  observer  occasionally  takes 
longer  to  examine  certain  points,  but  this 
does  not  affect  very  much  the  average  search 
rates  described  below.)  Our  approach,  there¬ 
fore  is  to  postulate  that  an  experienced  ob¬ 
server  searches  by  moving  an  apparent  aper¬ 
ture  (essentially  his  foveal  vision)  in  some 
fairly  regular  pattern  over  the  area  of  in¬ 
terest,  and  furthermore  that  he  adjusts  his 
average  interfixation  distance,  and  hence 
the  effective  size  of  his  scanning  aperture 
and  his  overall  search  rate,  in  accordance 
with  his  a  priori  information  on  the  size 
and  contrast  of  the  target  or  its  image.  In¬ 
tuitively,  one  recognizes  that  an  observer 
will  scan  the  floor  around  him  differently 
if  he  is  looking  for  a  pencil  or  an  ant. 
Stated  more  formally,  the  observer  estimates 
how  far  off  his  visual  axis  he  will  still 
have  an  adequate  probability  of  detecting 
the  expected  image,  and  he  automatically 
adjusts  his  search  rate  accordingly.  A  key 
concept,  therefore,  is  the  size  of  the  ef¬ 
fective  scanning  aperture — here  called  a 
glimpse  aperture,  Ag.  This  is  a  quantity 
that  commonly  ranges  from  10  to  100  times 
the  area  of  the  target,  a-j-,  but  can  some¬ 
times  vary  between  1  and  1000  times  a^.. 

The  reason  for  this  huge  spread  is  not 
just  the  observer’s  inability  to  predict 
the  nature  of  the  image  or  his  own  detec¬ 
tion  probabilities.  It  lies  also  in  a 


second  important  factor — the  structure, 
complexity,  or  "congestion”  of  the  sur¬ 
rounding  scene.  The  search  for  an  ant 
mentioned  above  will  also  be  quite  dif¬ 
ferent  depending  on  whether  the  floor  is 
covered  with  a  nearly  featureless  lino¬ 
leum  or  a  textured  and  patterned  rug. 
However,  this  "congestion"  cannot  be 
described  solely  by  the  two-dimensional 
spatial-frequency  content  in  a  scene. 

What  really  matters  is  the  density  of 
contrast  points — the  natural  fixation 
centers  for  the  eye — or  other  "confusion 
objects”  that  are  present  in  the  scene. 

The  writer  once  experienced  a  striking 
example  of  many  such  false  targets  (nat¬ 
ural  decoys,  as  it  were)  while  flying 
over  the  notorious  Coso  Range  in  Califor¬ 
nia.  This  region  contains  scattered  trees 
and  bushes  which  appear  very  dark  against 
the  background  of  sandy  soil  or  dried 
grass,  as  do  the  vehicles  and  "bridges" 
which  were  placed  in  the  area  as  "targets". 
Almost  every  tree  had  to  be  examined  to 
see  whether  or  not  it  had  straight  sides 
before  the  true  targets  could  be  found. 
Indeed,  tests  there  have  produced  some 
of  the  lowest  target  acquisition  proba¬ 
bilities  ever  measured. 

The  kind  of  adaptive  search  rate 
described  here,  in  which  the  observer 
automatically  reacts  to  both  the  charac¬ 
ter  of  the  scene  and  the  (anticipated) 
nature  of  the  target  imbedded  in  that 
scene,  has  been  advocated  informally  by 
this  writer  for  several  years.  Only  one 
independent  reference  to  such  a  concept 
was  found  in  the  literature  (Williams, 

1966) .  Williams  talks  about  target 
"conspicuity" ,  which  is  measured  by  the 
rate  at  which  a  particular  target  can  be 
successfully  searched  for  in  a  particu¬ 
lar  field,  and  he  points  out  that  the 
commonly  observed  lack  of  dependence  of 
target  acquisition  on  display  scale  fac¬ 
tor  (within  limits,  and  assuming  no 
change  in  information  content  on  the 
display)  is  another  manifestation  of 
observer  adaptation. 

A  heuristic  derivation  of  an  ex¬ 
pression  for  Pp  follows.  If  an  area  As 
is  to  be  searched,  the  number  of  glimpses 
(each  of  area  A~  =  leap)  required  to  cover 
the  area  is  As/Ag.  The  number  of  glimpses 
that  are  available  in  t  sec,  at  1/3  sec 
per  glimpse,  is  3t.  With  perfectly 
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systematic  search ,  the  probability  of  "look¬ 
ing  at"  the  target  (i.e.,  including  it  with¬ 
in  a  glimpse  aperture)  would  be  just  the 
ratio  of  the  available  glimpses  to  the  total 
number  required,  or  3t/(As/Ag).  This  would 
give  Pi  the  form  of  a  linear  ramp  function 
with  time.  Real  search  is  probably  some¬ 
thing  between  perfectly  systematic  and 
purely  random,  so  that  Pp  should  have  a 
form  that  lies  between  the  ramp  and  an  ex¬ 
ponential  rise.  We  conservatively  adopt 
the  latter  and  postulate 

-K  x  3t/(Ag/kaT) 


where  K  is  a  constant  and  k  is  a  parameter 
related  to  scene  congestion. 

The  supporting  experimental  evidence 
and  the  evaluation  of  the  constants  can  be 
found  in  the  original  reference  (Bailey, 
1970).  However,  one  more  idea  must  be  in¬ 
troduced.  Since  the  value  of  k,  the  number 
of  target  areas  in  a  glimpse  aperture,  is 
usually  found  to  lie  in  the  range  10  to  100, 
we  substitute  100/G  for  k,  where  G  is  a  meas¬ 
ure  of  the  congestion  in  a  scene.  Formally, 
it  is  the  number  of  fixation  centers  within 
the  nominal  glimpse  aperture  of  100  a^;  in 
practice,  it  is  nothing  more  than  an  esti¬ 
mate  by  the  observer  of  the  congestion  of 
a  particular  scene  relative  to  his  experi¬ 
ence,  and  he  is  asked  to  assign  a  number 
which  usually  falls  between  1  and  10.  Al¬ 
though  this  estimate  may  be  quite  crude, 
such  as  no  better  than  within  a  factor  of 
two,  it  is  believed  to  be  far  better  than 
ignoring  the  problem  altogether.  Further 
studies  are  needed  to  refine  this  trouble¬ 
some  point. 

Accordingly,  we  propose  the  following 
expression  for  Pp : 

-[ (700/G) (a  /A  )t] 

Px  -  1  -  e  18  .  (2) 

The  Contrast  Term 

The  second  term,  P2,  has  to  do  with 
the  basic  process  of  contrast  detection  by 
the  human  visual  system.  Blackwell’s 
classical  experiments  provide  the  funda¬ 
mental  data  here,  yielding  curves  of  thres¬ 
hold  contrast  (50  percent  detection  proba¬ 
bility)  versus  size  of  circular  discs  under 


various  levels  of  ambient  illumination. 
These  are  commonly  called  "demand”  con¬ 
trast  functions.  However,  there  is  a 
good  deal  of  evidence  that  the  best 
(i.e.,  lowest)  threshold  values  obtained 
by  Blackwell  must  be  adjusted  upward  sub¬ 
stantially  for  application  to  the  prac¬ 
tical  situations  discussed  in  this  paper. 
Ignoring  some  of  the  details,  which  can  be 
found  in  the  original  reference  (Bailey, 
1970),  it  is  proposed  that  the  shape  of 
the  "demand"  curve  of  threshold  contrast 
C*p  versus  angular  subtense  a  in  minutes 
of  arc  (min)  be  taken  from  the  average 
of  the  two  best-known  sources  of  1/3-sec 
data,  and  that  this  curve  be  adjusted  up¬ 
ward  by  a  factor  of  about  5.5  in  contrast — 
or  that  0.75  be  added  to  log  contrast. 
Since,  in  the  absence  of  bright  lights  or 
specular  glint,  target  contrasts  (using 
the  absolute  value  definition)  greater 
than  unity  are  rarely  observed  through 
the  real  atmosphere,  and  even  less  fre¬ 
quently  on  military  targets,  the  result¬ 
ing  curve  on  log-log  paper  can  be  approxi¬ 
mated  by  the  hyperbola 

(log  CT  +  a) (log  a  +  0.5)  =  1.  (3) 

This  simplification  is  often  convenient 
and  usually  adequate,  but,  whenever  con¬ 
trasts  greater  than  unity  are  important 
(for  example,  on  certain  electro-optical 
displays) ,  a  more  accurate  curve  with  an 
asymptotic  slope  of  -1/2  should  be  used. 

The  probability  of  detection,  P2, 
at  the  threshold  contrast  is,  by  defini¬ 
tion,  50  percent.  The  probability  of 
detection  for  other  values  of  observed 
contrast,  C,  has  been  shown  to  depend 
only  on  the  ratio  C/Cp  and  to  have  the 
form  of  the  cumulative  normal  distribu¬ 
tion  with  P2  “  0.9  for  C/Cp  «  1.5.  This 
is  equivalent  to  setting  the  value  of 
the  Gaussian  standard  deviation  equal  to 
0.39,  and  it  indicates  that  on  the  aver¬ 
age  Blackwell’s  subjects  chose  to  operate 
at  a  false-alarm  rate  of  about  1/200, 
corresponding  to  an  S/N  of  roughly  2.6:1. 
Accordingly,  we  write 

{ [ (c/ CT)-l]/o. 39}  2 

P„  =  -pr  /  e  U  du.  (4) 

V  2ir  -« 
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The  Resolution  Term 


The  third  term,  P3,  has  to  do  with 
the  more  subjective  act  of  deciding  what 
particular  image  forms  represent  in  the 
real  world.  But,  since  we  are  primarily 
concerned  with  shape  recognition  of  known  or 
briefed  objects  as  distinct  from  the  inter¬ 
pretation  of  unfamiliar  imagery,  the  prob¬ 
lem  can  be  reduced  to  the  visibility — or 
detectability  in  the  sense  of  the  previous 
section — of  sufficient  geometrical  detail 
for  shapes  to  be  compared  with  memory  and 
thereby  recognized.  The  concept  of  "suffi¬ 
cient"  detail  might  lead  one  into  the 
morass  of  "critical  details" — those  unique 
features  that  permit  various  classes  of 
objects  to  be  distinguished  from  one 
another.  However,  when  all  portions  of  an 
image  are  equally  detectable  so  that  the 
whole  shape  is  either  visible  or  not, 

Johnson  of  NVL  has  demonstrated  the  re¬ 
markable  fact  that,  for  a  variety  of  mili¬ 
tary  objects*,  a  single  parameter — namely 
Nr,  the  number  of  resolution  cells  con¬ 
tained  in  the  shortest  dimension  across  a 
target — is  all  that  is  required  to  describe 
what  constitutes  "sufficient"  detail  for 
detection  or  for  recognition.  He  found 
values  of  Nr  between  3.3  and  4.8,  or 
4.0  +  20  percent,  for  high-confidence 
recognition.  Other  experimenters  have  con¬ 
firmed  this  simplification  and  derived 
values  for  Nr  close  to  Johnson's  or  slight¬ 
ly  larger.  We  adopt  a  conservative  value 
and  write 

-[(N  /2)-l]2 

P3  -  1  -  e  r  Nr  >  2  (5) 

-  0  N  <  2 

r 

which  makes  P3  »  0.9  when  Nr  =  5. 

It  is  important  to  emphasize  the  mean¬ 
ing  of  Nr .  As  previously  defined,  it  is 
the  number  of  resolution  cells  contained 
in  the  minimum  dimension  (e.g. ,  width  or 
height)  of  the  projected  image  of  an  object 


One  should  probably  add  "in  a  mili¬ 
tary  context".  The  amount  of  detail  re¬ 
quired  to  distinguish  a  truck  from  an  ox¬ 
cart  is  far  less  than  that  required  to  dis¬ 
criminate  between  various  truck  models;  but 
the  simple  separation  of  objects  into 
classes  is  usually  sufficient  for  designat¬ 
ing  targets. 


to  be  recognized.  In  the  present  con¬ 
text,  "resolution  cells"  means  inde¬ 
pendently  detectable  spots — the  subject 
of  the  previous  discussion.  The  proper 
procedure  is  to  first  calculate,  from 
Eq.  (3),  the  size  of  the  smallest  spot 
that  can  be  seen — at  the  contrast  level 
with  which  the  target  is  presented  to  the 
observer.  Then,  the  number  of  these  de¬ 
tectable  spots  contained  in  the  shortest 
dimension  of  the  target  image  gives  the 
value  of  Nr. 

It  was  implied  above  that  recogni¬ 
tion  in  unfamiliar  situations  may  be  much 
more  complicated,  and  far  more  difficult 
to  predict,  than  the  mere  detection  of 
shape  details.  An  extreme  example  might 
be  the  classical  one  of  the  photo-inter¬ 
preters  searching  for  completely  unknown 
elements  of  the  Peenemunde  launching  areas 
during  World  War  II.  No  attempt  is  made 
to  extend  this  model  to  cover  such  cases. 
It  should  also  be  mentioned,  however,  that 
under  certain  other  circumstances  recogni¬ 
tion  may  be  very  much  easier  than  this 
model  would  predict.  Consider  the 
approach  of  unauthorized  aircraft,  or  the 
presence  of  vehicles  along  a  road  in  enemy 
territory.  Both  are  cases  in  which  the 
mere  detection  of  objects  might  be  suffi¬ 
cient  to  justify  the  decision,  "There  is 
a  target!".  These  cases  can  be  handled 
by  assigning  artifically  high  values  to 
P3  (when  the  prior  information  so  justi¬ 
fies)  ,  thus  effectively  equating  detec¬ 
tion  as  given  by  P2  to  recognition.  This 
point  is  discussed  further  below.  Our 
model  of  P3  covers  the  more  common  inter¬ 
mediate  cases  in  which  shape  provides  the 
primary  criterion  for  recognition. 

Discussion 

Equations  (2)  to  (3) ,  combined  as 
indicated  by  Eq.  (1),  constitute  the  pro¬ 
posed  model  of  a  human  observer.  The 
fundamental  concepts  and  the  basic  product 
formulation  are  explained  at  the  beginning 
of  this  paper.  The  result,  after  analyz¬ 
ing  each  of  the  three  terms,  is  an  expres¬ 
sion  for  the  probability  of  recognition  as 
a  function  of  several  observable  quanti¬ 
ties — the  apparent  size  and  contrast  of 
the  target,  and  the  required  search  rate 
and  the  false-target  density  in  the 
scene. 
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An  important  and  useful  property  of 
the  model  is  the  separation  of  variables 
that  has  been  achieved.  Each  of  the  terms 
is  expressed  as  a  function  of  a  rather 
small  number  of  input  parameters,  and  tar¬ 
get  size  and  contrast  are  the  only  parame¬ 
ters  which  enter  into  more  than  one  term. 
This  rather  significant  simplification 
arises  from  a  careful  consideration  of  the 
consequences  of  the  product  formulation 
and  a  detailed  evaluation  of  each  of  the 
terms  over  only  the  ranges  of  the  input 
variables  for  which  that  term  is  control¬ 
ling  or  otherwise  of  interest. 

For  example,  the  model  is  not  appli¬ 
cable  to  a  target  that  is  so  isolated  or 
whose  contrast  is  so  high  (relative  to  the 
background  clutter)  that  it  can  easily  be 
seen  with  peripheral  vision*,  since  in 
that  case  the  search  rate  can  be  very  much 
faster  than  postulated  in  Eq.  (2)  and 
will  still  be  very  high.  But  then  P2  will 
also  be  very  high  (essentially  unity) ,  and 
the  problem  is  almost  trivial.  The  search 
model  assumes  only  that  target  contrast  is 
not  that  high,  so  that  fairly  systematic 
and  fine-grained  search  must  be  carried 
out.  In  fact,  the  actual  search  rate  em¬ 
ployed  by  an  observer  is  determined  by 
some  sort  of  average  false-target  density 
over  the  scene.  If  the  actual  contrast  of 
a  specific  target  against  its  contiguous 
background  turns  out  to  be  less  than  suffi¬ 
cient  for  recognition  to  take  place  during 
a  single  properly-directed  glimpse,  this 
fact  will  show  up  in  P2  and  P3,  which  will 
correctly  reduce  the  value  of  PR. 

The  relationship  between  the  condi¬ 
tional  probabilities,  P2  and  P3,  can  be 
discussed  in  a  similar  way.  While  it  is 
clear  that  they  are  intimately  related, 
they  are  separated,  with  further  separa¬ 
tion  of  variables,  for  several  reasons. 
Ordinarily,  detection  not  only  precedes 
but  also  dominates  shape  recognition. 

That  is,  unless  P2  is  rather  high,  there 
is  probably  no  point  in  even  calculating 
P3,  since  Pr  will  be  too  low  to  justify 
the  sortie.  When  P2  is  high,  then  P3  con¬ 
trols.  On  the  other  hand,  as  has  been 
mentioned,  there  are  cases  in  which 


This  is  essentially  what  is  achieved 
by  multispectral  cueing  or  by  MTI  radar, 
as  indicated  on  page  8  of  the  original 
reference  (Bailey ,  1970). 


a  priori  or  contextual  information  may 
suffice  to  obviate  the  need  for  shape 
recognition  per  se.  In  such  cases — boats 
on  a  river  or  trucks  on  a  road,  for  ex¬ 
ample — P3  can  be  ignored  (i.e.,  set  to 
unity  without  regard  for  Eq.  (5))  and  P2 
will  control.  By  keeping  the  two  terms 
separate,  model  flexibility  is  preserved. 
Further  arguments  for  this  separation  re¬ 
volve  around  the  role  of  resolution. 

First,  as  a  practical  matter,  most  man¬ 
made  sensor  systems  are  resolution-limited, 
since  resolution  always  costs  something. 
(This  is  true  at  least  of  systems  whose 
displays  are  properly  designed.)  Accord¬ 
ingly,  the  sometimes-dif f icult  calculation 
of  system  MTF  need  be  applied  only  once 
(namely,  when  it  is  most  critical)  in  the 
shape-recognition  term.  More  importantly, 
there  are  many  cases  with  multiscaled  or 
zoom-capable  systems  in  which  the  combi¬ 
nation  of  a  priori  information  and  re¬ 
quired  search  area  may  make  a  two-step 
identification  of  the  target  desirable. 

In  such  cases  an  initial  and  tentative 
detection  on  a  wide  field  of  view  is 
followed  and  confirmed  (or  denied)  by 
shape  recognition  on  a  magnified  image. 

At  the  first  step  P2  controls,  but  PR  is 
incomplete;  at  the  second  step  P3  controls. 

The  product  of  the  three  terms  pro¬ 
vides  a  viable  model  for  a  wide  variety 
of  circumstances;  it  can  be  used  in  pre¬ 
dicting  the  capabilities  of  a  broad  class 
of  manned  systems,  since  it  deals  only 
with  the  observer  and  the  information  pre¬ 
sented  to  him,  whether  this  be  directly  to 
his  unaided  eyes  or  through  optical  aids, 
or  with  sophisticated  artificial  sensors 
provided  that  the  signal-to-noise  ratio 
is  high. 

APPLICATION  TO  FLIGHT  TOWARD  A  TARGET 

The  foregoing  basic  model  really 
applies  only  to  static  situations,  as  when 
looking  from  a  hovering  helicopter.  In 
normal  flight  approaching  a  target,  the 
time  constraints  enter  in  a  somewhat  differ¬ 
ent  manner. 

Firstly,  it  is  observed  that  the  pro¬ 
duct  P2P3  gives  a  single-glimpse  probabil¬ 
ity  of  recognition  as  a  function  of  the 
distance  from  the  target.  The  distance, 
together  with  the  flight  path  and  altitude 
of  course,  determine  the  proper  projection, 
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the  angular  subtense,  and  the  apparent  con¬ 
trast  (through  the  atmosphere)  of  the  tar¬ 
get.  Secondly,  repeated  glimpses  in  the 
same  direction  (i.e. ,  fixed  on  the  same 
contrast  point)  are  not  independent  in  the 
usual  probablistic  sense  and  do  not,  per 
se,  increase  Pr  significantly.  What  does 
change  with  repeated  looks  in  normal 
flight  is  that  the  observer  is  getting 
closer  to  the  target.  Thus  the  Pr  vs  D 
curve  with  P]_  «  1,  which  may  look  like  a 
cumulative  probability  curve,  really  is 
not  that;  it  is  simply  a  plot  of  the  in¬ 
creasing  probability  of  recognition  as  one 
approaches  a  target — on  the  assumption 
that  no  search  at  all  is  required. 

The  need  to  search  for  a  target  can 
only  delay  its  acquisition,  or,  saying  it 
differently,  a  given  probability  of  recog¬ 
nition  will  be  realized  at  some  shorter 
range.  To  determine  the  amount  the  origi¬ 
nal  curve  must  be  shifted  toward  later 
(i.e.,  smaller)  values  of  range,  one  must 
utilize  the  concept  of  searching  over  a 
specific  area,  Ag — such  as  a  SAM-site 
pattern,  or  the  area  within  a  revetment, 
or  the  area  enclosed  by  a  reticle  that  is 
adjusted  to  indicate  the  navigational 
error,  to  mention  a  few  examples.  If  the 
search  over  this  area  is  thoroughly  sys¬ 
tematic,  which  it  might  be  since  the  search 
is  completely  structured,  then  the  maximum 
delay,  T,  would  be  the  time  taken  to  search 
the  indicated  area  (given  by  As/3Ag  « 
As/3kax  =  AgG/300ax).  On  average  the 
delay  would  be  close  to  half  that  quantity, 
and  the  corresponding  range  shift  would 
then  be  Vt/2,  where  V  is  the  approach 
velocity.  (When  the  search  area  is  not 
well  defined,  as  in  locating  the  first  and 
grossest  checkpoint,  an  arbitrary  expres¬ 
sion  of  the  form  t  (sec)  =  14-  V(kn)/200 
has  been  used.) 

If  the  search  were  indeed  ideally  sys¬ 
tematic,  the  time  delays  would  lie  between 
zero  and  twice  the  average  value;  and  in¬ 
deed  this  kind  of  spread  in  field  test  data 
nust  be  anticipated.  With  a  search  pattern 
that  is  less  than  ideal,  the  Pr  vs  D  curve 
will  be  both  displaced  and  flattened,  with 
the  result  that  the  probabilities  calcu¬ 
lated  by  the  method  described  will  be  a 
Little  optimistic  at  the  high  side  (i.e., 
close  to  the  target). 


APPLICATION  TO  MOVING  TARGETS 

In  1970,  Rand  conducted  some  simple 
simulation  experiments  (Dugas,  1971; 
Petersen,  1971)  on  the  detection  of  sta¬ 
tionary  and  moving  targets.  The  data  con¬ 
sisted  of  the  times  taken  by  several  ob¬ 
servers  to  search  a  TV  screen  and  find  a 
simple,  electronically  generated,  rectangu¬ 
lar  target  against  a  background  that  was 
variously  a  piece  of  felt,  an  artificial 
grid  pattern,  or  scenes  from  aerial  pho¬ 
tography.  The  exponential  time-dependence 
was  confirmed,  and  the  effect  of  target 
motion  could  be  well  accounted  for  by  in¬ 
troducing  into  the  exponent  of  the  P^  term 
a  factor  (1  4*  0.45  V^) ,  where  V  is  the 
target  velocity  in  degrees/ second  sub¬ 
tended  at  the  observer* s  eye.  There  was 
some  indication  that  part  of  the  effect  in 
moving  objects  being  "easier**  to  detect  is 
due  to  the  changing  contrast  as  the  object 
passes  over  a  complex  background — analo¬ 
gous  to  the  detection  of  flashing  lights. 
However,  this  effect  could  not  be  quanti¬ 
tatively  separated  out  in  these  experiments. 

CONCLUDING  REMARKS  CONCERNING  NOISE 

The  foregoing  description  has  ignored 
noise  altogether.  Historically,  I  used  to 
think  that  this  model  could  be  equally 
applicable  to  an  observer  viewing  an  E-0 
display,  except  that  his  performance  would 
necessarily  be  degraded  by  the  insertion  of 
electrical  noise.  I  therefore  multiplied 
the  product  by  a  fourth  term  which 

was  given  by 

“[ (S/N)-l] 

n  =  1  -  e  S/N  >_  1 

=0  S/N  <  1  . 

This  form  has  been  used  at  Rand,  and  by 
other  people  too  (I*m  told),  to  make  per¬ 
formance  predictions  that  seem  to  be 
reasonable.  It  is  certainly  better  than 
nothing;  but  it  is  now  clear  that  the 
concept  of  "perceived  signal-to-noise" 
described  here  this  morning,  and  the  pro¬ 
cedures  that  were  given  for  applying  it, 
constitute  a  much  better  approach  to  under¬ 
standing  recognition  through  noise.  The 
procedures  may  appear  to  some  to  be  a 
little  arbitrary  or  ad  hoc-leh ,  but  they 
do  work.  Perhaps  a  more  fundamental  (or 
at  least  a  more  satisfying)  theoretical 
approach  can  be  evolved  building  on  Ory's 
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work,  mentioned  earlier  in  this  paper. 

Perhaps  someone  in  this  audience  will  do 

just  that  in  the  very  near  future. 
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ABSTRACT 

A  survey  and  critique  is  made  of  nine 
analytical  models  which  have  been  developed 
or  used  recently  to  describe  the  perform¬ 
ance  of  thermal  imaging  sensors  as  target 
acquisition  devices.  The  models  are  those 
developed  by  Bailey  (Rand) ,  Bailey/Martin 
(Martin  Marietta) ,  the  MARSAM  (Honeywell) , 
the  one  developed  by  NAFI,  the  classical 
model  of  Texas  Instruments , the  probabilistic 
model  of  Texas  Instruments,  the  FLAG  (Hughes) 
model,  the  one  developed  by  NVL  (Ft.  Belvior) , 
and  the  one  developed  by  EOS.  Mention  is 
also  made  of  current  modelling  efforts  at 
NADC  to  model  ship  classification  with 
FLIR  sensors. 

A  brief  discussion  is  then  given  of  the 
critical  physical  operations  and  functions 
associated  with  target  acquisition,  as  well 
as  the  more  primitive  phenomena  from  which 
these  operations  and  functions  are  struc¬ 
tured. 

The  critical  operations  are  defined  in 
terms  of  (presumably)  measurable  parameters. 
The  state-of-the-art  of  the  modelling  depends 
on  the  understanding  and  empirical  status  of 
these  parameters.  An  examination  of  this 
status  is  given. 

INTRODUCTION 

Attention  has  focussed  in  recent  years 
on  thermal  imaging  sensors  because  of  their 
impressive  operational  target  acquisition 
performance.  These  sensors,  usually  of  the 
forward-looking  infrared  (FLIR)  kind,  reached 
an  engineering  maturity  during  the  middle  and 
late  sixties.  The  engineering  and  operational 
evolution  have  progressed  at  a  rate  that 
easily  outdistanced  the  analytical  and 
phenomenological  understanding  of  these 


devices  with  respect  to  target  acquisi¬ 
tion  (an  excellent  historical  and  func¬ 
tional  treatment  of  FLIRs  is  given  in 
Lloyd ' s  Thermal  Imaging  Systems,  to  be 
published1) .  This  situation  has  led  to 
efforts  to  rigorously  understand  the 
sensor  parameters  which  influenced  tar¬ 
get  acquisition  and  to  standardize  their 
definition  and  specification  (Biberman's 
FLIR  Specification  Guide  2.) 

There  is  at  this  time  by  no  means  a 
universally  accepted  approach  to  FLIR 
target  acquisition  modelling.  This  will 
be  brought  out  by  this  paper,  which  will 
survey  several  models  which  have  appeared 
recently  in  the  literature  or  which  have 
been  used  recently  in  documents  describing 
relationships  between  sensor  parameters  and 
target  acquisition  performance. 

This  paper  will  also  describe  the 
scenic,  physical,  sensor,  and  pychophysical 
relationships  which  seem  to  impact  target 
acquisition  performance.  The  paper  will 
close  with  comments  on  the  state  of  under¬ 
standing  these  relationships,  and  descrip¬ 
tions  of  some  recent  work  towards  improving 
the  understanding. 


TYPICAL  MODELLING  APPROACHES 

Most  target  acquisition  models  have  the 
structure  of  a  cascade  or  nesting  of  con¬ 
ditional  probabilities.  The  various  con¬ 
ditional  probabilities  have,  in  general, 
different  content  and  meaning  in  the  vari¬ 
ous  models.  Some  of  the  conditional  pro¬ 
babilities  in  a  given  model  are  functions 
of  the  same  parameter,  so  that  it  has  been 
suggested  that  the  usual  product  form 
should  be  replaced  in  part  by  convolutions . ^ 
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The  individual  conditional  probabilities 
invariably  are  derived  from  an  analysis  of 
independent  experiments.4  To  date  target 
acquisition  measurements  have  not  been 
carried  out  which  allow  a  proper  evalua¬ 
tion  of  the  relative  importance  of  the 
phenomena  described  by  the  individual 
relations.  Because  of  this,  the  transi¬ 
tion  among  the  various  phenomena  in  the 
course  of  target  acquisition  is  not  well 
modelled,  hence  the  reliance  on  simple 
statements  such  as  nesting  of  conditional 
probabilities . 

Tables  I  through  III  describe  function¬ 
ally  the  MARSAM  II,  Bailey  and  Bailey /Martin 
models  of  target  acquisition.  The  tables 
illustrate  the  nesting  approach,  as  well  as 
the  different  content  in  the  elements  of  the 
nesting. 

Most  models  derive  the  probabilities  of 
target  detection,  recognition  and  identifica¬ 
tion  in  separate  computations.  Assumptions 
are  then  made  about  the  transition  from  detec¬ 
tion  to  recognition  (for  example)  in  order  to 
compute  system  performance  (with  auxiliary 
mission  data)/ such  as  probability  of  first 
pass  attack.  Appropriate  measurements  have 
not  been  carried  out  and/or  analyzed  to  allow 
the  correct  mission  characterization  of  the 
transition  from  detection  to  recognition  and 
identification. 

Some  of  the  models  are  designed  to  describe 
the  sensor  performance  in  the  sense  of  image 
quality,  having  in  mind  perhaps  the  require¬ 
ments  of  a  photo-interpreter  (e.g. ,  MARSAM  II) . 
These  models  tend  to  be  quite  analytical. 

Other  models  are  designed  to  associate  the 
sensor's  primary  specifications  with  real 
and  stylized  target  acquisition  (e.g. , Bailey/ 
Martin) .  These  models  tend  to  be  phenomenol¬ 
ogical.  Hybrid  (analytical/phenomenological) 
combination  also  exist  (in  fact,  most  models) . 

TARGET  ACQUISITION  MODEL  SURVEY 

Tables  IV  through  XII  depict  the  essential 
contents  of  nine  current  FLIR  target  acquisi¬ 
tion  models.  Two  other  modelling  efforts  are 
not  depicted  but  are  of  importance:  NADC's 
(Moser) ®  analysis  of  ship  classification  via 
FLIR  imagery  and  APG's  (Pibble) 7  modelling  of 
thermal  signatures  and  their  sampling  (for  IR 
seeker  optimization) . 


The  tables  indicate  that  most  models 
ignore  the  search  portion  of  target  ac¬ 
quisition.  One  of  the  models  (Bailey/ 
Martin)  has  included  search  as  an  effi¬ 
ciency  factor  in  which  the  human  scan 
tendencies,  the  scan  geometry  (imposed 
by  the  mission  and  geography,  and  pre¬ 
briefing)  ,  the  time  available  to  scan 
(  imposed  by  geometry,  V/h  and  FOV)  and 
the  scene  congestion  (number  of  potential 
false  targets)  determine  the  dwell  time 
on  the  true  targets.  All  models  which 
are  philosophically  similar  to  the  Bailey 
model  are  compatible  with  such  a  search 
term.  The  identification  of  clutter  with 
false  alarms  is  not  made  exact  in  any  of 
the  models . 

The  tables  also  show  that  several  of 
the  more  developed  models  are  derivatives 
of  the  Bailey  approach,  indicating  a  cer¬ 
tain  aesthetic  and  intuitive  appeal  of 
that  approach. 

Theoreticians  in  this  discipline  ex¬ 
hibit  extreme  differences  in  the  descrip¬ 
tion  of  the  detectability  and  resolution 
criteria  upon  which  their  modelling  is 
based.  Some  of  the  models  make  target 
detectability  a  function  of  satisfying 
the  Blackwell  criteria;  others  (notably 
NVL)  make  it  strictly  a  function  of 
signal  to  noise. 

With  respect  to  resolution  criteria, 
some  of  the  models  describe  pattern  recog¬ 
nition  as  a  certain  number  of  "resolvable 
spots"  for  a  given  level  of  task  difficulty 
(detection,  recognition,  identification) . 
Among  these  are  the  Bailey,  NVL,  NADC  models 
Other  models,  notably  the  Bailey/Martin, 
take  a  somewhat  pragmatic  approach  and 
associate  with  each  class  of  tasks  and 
targets  an  empirical  "bar  target  equiva¬ 
lent.”  Models  which  mix  the  resolvable 
spots  and  bar  target  equivalents  notions 
are  not  well  thought  out. 

The  resolvable  spots  approach  is 
analytically  akin  to  the  Schade  measures 
of  image  quality  such  as  N  ^ .  Thus , 
if  and  when  an  explicit  FlSr  model  is 
based  on  MTFA,  16  it  will  fall  in  this 
category. 
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TABLE  I 


MARS AM5 


Mathematical  Model  of  Target  Acquisition 


Basic  Definitions 


p  The  probability  of  having  a  line  of  sight  to  the  target 

los 

P*^  Conditional  probability  that  the  element  size  and  contrast  are 

sufficient  for  detection. 


P^*  Conditional  probability  that  a  displayed  element  of  sufficient 

size  and  contrast  will  be  fixated  by  an  observer  within  a  specified 
time . 


P^*  Conditional  probability  that  an  observer-viewed  element  of  sufficient 

size  and  contrast  will  be  correctly  discriminated  in  the  presence  of 
confusing  objects. 


d4 


P  ,  and 
d 


P 

r 


Conditional  probability  of  sufficient  signal  to  noise  for  detection. 

Computed  on  the  assumed  statistical  independence  of  P^  P^  and  P^ 

with  respect  to  each  other  and  with  respect  to  P  and  P  .  Effects 

d2  d  3 

of  confusing  objects  are  not  independent  of  time  and  the  search  mode 

P,_  is  expressed  as  a  function  of  P__. 
d2  d3 


Basic  equations: 


los  ° 


dl 


P  * (P  *) 
d2*  d3^c 


d4 


P 

r 


P  * 

r 


conditional  probability  of  target- 
element  recognition 


(Based  on  Defense  Research  Corporation  model  from  Stathacopoulos) 
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TABLE  II 

Bailey  Model  of  Target  Detection  Through  Visual  Recognition^ 

Basic  Definitions 

1.  p  is  the  probability  that  an  observer,  searching  an  area 

known  to  contain  a  target,  looks  for  a  specified  glimpse 
time  in  the  direction  of  the  target  with  his  foveal 
vision. 

2.  P2  is  the  probability  that  if  a  target  is  viewed  foveally 

for  one  glimpse  period  it  will,  in  the  absence  of  noise, 
be  detected. 

3.  P^  is  the  probability  that  if  a  target  is  detected  it  will 

be  recognized  (again  during  a  single  glimpse  and  in  the 
absence  of  noise) . 

4.  Quantity  n  is  an  overall  degradation  factor  arising  from  any 
noise  in  the  image  that  is  viewed  by  the  observer. 


BASIC  EQUATION 


PR  ■  Pl°  p2°  p3  •  " 
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TABLE  III 


Bailey/Martin  Marietta  Mathematical  .  lei  of  Target  Acquisition3 

Basic  Definitic  is 


=  The  conditional  probability  of  fixating  the  target  with  the 
operator's  glimpse  aperture  (fovea)  within  the  available 
search  time  which  is  determined  by  the  sensor's  FOV,  the 
mission  parameters  and  the  scene  clutter,  i.e.,  false 
target  density?  the  FOV  and  mission  parameters  determine 
the  rate  of  change  of  area  to  be  searched?  prebriefing 
and  scene  cues  are  introduced  in  the  geometric  definition 
of  the  search  (spot,  gross  or  detailed  linear,  gross  or 
detailed  area,  and  pseudo  random) ; 

=  The  conditional  probability  of  having  the  required  contrast/ 

target  size  relationship  to  mediate  target  detection  given 
that  the  target  is  in  the  operator's  glimpse  aperture?  if 
the  scene  statistics  (contrast  distribution  in  the  scene) 
is  known,  the  P^  as  defined  is  convolved  with  the  scene 
statistics  to  give  the  effective  P^ ; 

P^  =  The  conditional  probability  of  target  pattern  recognition 
reduced  to  a  bar  target  equivalent?  targets  and  tasks 
(detection,  recognition  and  identification)  are  ascribed 
different  bar  target  equivalents,  and  these  are  defined 
empirically;  bar  target  equivalents  are  not  defined  for 
some  targets  and  tasks  (e.g. ,  ship  classification) ? 

P^  =  The  conditional  probability  that  there  is  sufficient  (determined 

empirically)  displayed  signal  to  noise  ratio  (SNRD)  to  mediate 
the  bar  target  equivalent. 


The  basic  equations  are: 

PD  =  P1-VP3(D)’P4(D)' 

PR  =  VR>-VR)  » 

PI  =  P3(I)  P4(I)  ' 

for  detection,  recognition  and  identification,  respectively. 
The  symbols  D,R,I  represent  the  functional  description  of  the 
bar  target  equivalents  for  detection,  recognition,  identifica¬ 
tion.  Typically,  these  can  vary  with  the  type  of  sensor  used. 
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TABLE  IV 


A  Summary  of  the  Honeywell  Multiple  Airborne  Reconnaissance 
Sensory  Assessment  Model  (MARSAM  II)5 


1. 

TYPE 

Stored  Program,  Data  Bank 

2. 

PRIMARY 

PURPOSE 

Assess  Reconnaissance  Sensor 

Systems 

3. 

SENSOR 

TYPES 

Frame  Camera,  Panoramic  Camera, 

TV,  Visual  Observer,  Vertical  IR, 

FLIR,  SLAR,  FLR,  ELINT 

4. 

TERRAIN  AND 

FOLIAGE  MASKING 

As  submodel,  computes  probability 
of  having  a  line  of  sight  to  target 

5. 

HUMAN  FACTORS 

SUBMODEL  INPUT 

VARIABLES 

Target  element/background  display 
Contrast  Angular  Element  Size, 

Time  Duration  of  Scene,  #  of  Con¬ 
fusing  Objects,  SNRD 

6. 

BASIC 

STRUCTURE 

P  =  P  P  P  (P  )  P 

d  los°  dl°  d2(  d3)°  d4 

P  «  P_0P* 
r  d°  r 

7. 

SEARCH  MODEL 

Random  search  only,  5°  glance  at 
the  eye  (each  glance  independent) , 

Based  on  fixation  area,  computes 
expected  number  of  fixations. 

8. 

RECOGNITION 

SUBMODEL 

Based  on  Brainard's  scan  line 

requirements  data,  TV  and  IR  =  Nfc 

6  scan  lines  for  P  =.5. 

r 

9. 

ATMOSPHERIC 

MODEL 

Extensive  weather  model  (5  types  in 
store)  ,  uses  atmospheric  layers  to 
compute  transmittance,  apparent  path 
luminance  computed  as  a  function  of 
backs catter  and  ground  illumination. 

10. 

TARGET-ELEMENT 

DISTINCTIVENESS 

Computes  target-element  distinctive¬ 
ness  coefficient  which  accounts  for 
cuing. 

11. 

SENSOR  MODEL 

BASIS 

Limiting  resolution  of  sensor  projected 
on  ground. 

12. 

FLIR  TRACKING 

MODE 

Evaluates  effectiveness  of  tracking  or 
not  tracking  targets  to  enhance 
recognizability . 

13. 

CONTRAST  TERM 

P*  based  on  Blackwell  corrected  by 

.75  log  units  (as  recommended  by  Black- 

well) .  Based  on  Contrast  Ratio  (Apparent 
vs  contrast  threshold) .  Corrected  for 
brightness  of  background  less  than  10 
ft.  Lamberts. 
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TABLE  IV  (Cont) 


14.  P* 

r 

Uses  no.  of  visual  acuity  elements 
(for  direct  vision)  across  effective 
min.  target- element  dimension.  More 
than  3.2  required,  (arc  min.) 

15.  PLATFORM 

MOTION 

Computes  platform  motion  limiting  by 
computing  forward  motion  compensation 
in  TV  model,  image  smear  distance. 
Resultant  combined  effects  evaluated 
by  increasing  ground  resolution  distance 
D. 

16 .  SNR 

Based  on  Schade ' s  data,  empirical  fit 

1.4  for  P=.50,  P~.  90  at  about  3.5. 

Uses  Stathacopoulos  for  number  of  TV 

scan  lines  vs  P*. 

r 

17.  DISPLAY  \ 

MOVEMENT 

Minimum  threshold  of  P  for  display 
target  size  as  a  function  of  angular 
rate.  (TV  and  FLIR)  uses  Dugas  for 
threshold  function.  Exponential  in¬ 
creases  in  visual  acuity  threshold 
above  25  degr/sec.  Same  in  visual 
observer  model. 

18.  VISUAL  OBSERVER 

MODEL 

Computes  circular  search  area  offset 

from  flight  direction,  uses  minimum 

search  range,  maximum  search  range  to 

obtain  P.  and  A  for  far  middle  and 

los  eg 

near  points  for  averaging. 

19.  BACKGROUND  AND 

TARGET  REFLEC¬ 
TIVITY  (or 
equivalent  IR 
characteristic) 

Background  reflectivity  input  used  in 
confusing  objects  submodel  and  back¬ 
ground  brightness  computation.  Also, 
target- element  reflectivity  in  observer 
and  camera  models. 

20.  CAMOUFLAGE 

Camouflage  contrast  degradation  factor 
reduces  contrast  in  observer  TV  model 
and  camera  models. 

21.  TARGET  TYPES 

Area  and  linear  types  (radial  extent 
of  area  target) 

22.  DISPLAY  TYPE 

Stationary  moving  or  track  modes 
possible  in  human  factors  display 
submodel. 

23.  MULTI -SENSOR 

IMAGERY 

Hard-copy  only,  uses  weighting  co¬ 
efficients  for  4  spectral  regions 
for  each  of  D/R/I ,  4  spectral  regions 
applied  to  15  types  of  targets.  Uses 
Inter-Service  Multi-Sensor  Imagery 
and  Data  Storage  and  Retrieval  Systems 
data. 

24.  THERMAL  BASJS 

Power  received  from  target  and  back¬ 
ground  and  calculated  NEP. 
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TABLE  V 


A  Summary  of  the  Bailey/Martin  Model  of  Target  Detection 
Through  Visual  Recognition^ 


1 .  TYPE 

Calculational  (atmospheric  data  stored) . 

2.  PRIMARY 

PURPOSE 

To  describe  quantitatively  the  capabilities 
and  limitations  of  a  human  observer  with 
respect  to  target  acquisition  using  an 
imaging  sensor. 

3.  SENSOR 

TYPES 

CRT  displayed  TV,  RGTV  or  FLIR  imagery. 

4 .  TERRAIN  AND 

FOLIAGE  MASKING 

Explicitly  included  in  range  gated  TV 
(RGTV)  routine;  FOV  projection  (foot¬ 
print)  included  in  TV,  RGTV,  FLIR  com¬ 
puter  routines. 

5 .  HUMAN  FACTORS 

SUBMODEL  INPUT 

VARIABLES 

Psychophysical  model.  Search,  detection 
of  contrast,  recognition  of  shapes;  all 
conditional  probabilities  and  a  degrada¬ 
tion  factor,  based  on  SNRD. 

6.  BASIC 

STRUCTURE 

P,  .  *  P,*P OAP^ftP„  with 

d,r,i  1°  2°  3°  4, 

P,  =  P0  =  1.0  for  P  &  P. 

12  r  l 

7 .  SEARCH  MODEL 

Based  on  glimpse,  moving  aperture  (adjust¬ 
able  in  size)  for  search  model.  Ranges 
from  random  to  organized  search  depending 
on  mission  parameters. 

8.  RECOGNITION 

SUBMODEL 

Based  on  MAFLIR  data  (FLIR  model  only) 
and  NVL  data,  hence  phenomenological. 

9 .  ATMOSPHERIC 

MODEL 

Barhydt  (Hughes)  humidity  and  temperature 
algorithm  used  to  compute  equivalent  water 
per  1000  ft.?  transmittance  curves  entered 
to  determine  degradation  coefficient; 
altitude  and  slant  range  entered;  atmospheric 
layering  and  scattering  currently  not 
entered. 

10 .  TARGET-ELEMENT 

DISTINCTIVENESS 

Uses  search  type  to  change  number  of 
glimpses  required;  uses  clutter  to 
determine  search  time  available  per  FOV. 
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TABLE  V  (Cont) 


11.  SENSOR  MODEL 

BASIS 

Thermal  Signature  and  MRT  of  sensor,  i.e., 
based  on  Johnson  type  bar  target  equivalents 

12.  FLIR  TRACKING 

MODE 

No  capability  to  simulate  tracking  mode. 

13.  CONTRAST  TERM 

Based  on  Blackwell  corrected  according  to 
Davies.  Also  adds  .75  to  log  contrast. 
Computes  C/Ct  (PgC))=1.5. 

14.  P* 

r 

U.  S.  Army  NVL  and  MAFLIR  data  for  P^=.9; 
invokes  Johnson  notion  of  bar  target 
equivalent. 

15.  PLATFORM  MOTION 

None  (Motional  MTF  included  as  degradation 
to  sensor  MTF  and  MRT) . 

16.  SNR 

Separate  P^  Term;  uses  MRT  to  compute  SNRD 
for  bar  target  equivalent  (separate  computa¬ 
tion  for  detection,  recognition,  identifica¬ 
tion)  . 

17.  DISPLAY 

MOVEMENT 

None  (as  originally  formulated) ;  can  be 
included  as  a  Motional  MTF  term. 

18.  VISUAL  OBSERVER 

MODEL 

Built  into  all  conditional  probabilities 
(empirical  data) . 

19.  BACKGROUND  AND  TARGET 

REFLECTIVITY  (or 
equivalent  IR 
characteristic) 

Thermal  signature  inputted  where  available; 
otherwise  strong  dependence  on  MAFLIR 
measurements . 

20 .  CAMOUFLAGE 

Tacitly  included  in  clutter  and  thermal 
contrast. 

21 .  TARGET  TYPES 

None  (uses  critical  dimension  of  target 
and  bar  target  equivalent;  other  notions 
required  for  complex  and  maritime  targets) . 

22.  DISPLAY  TYPE 

None;  CRT  tacitly  assumed. 

23.  MULTI-SENSOR 

IMAGERY 

None;  sensor  models  are  independent. 

24.  THERMAL  BASIS 

MRT  and  target  and  background  thermal 
signature  (must  be  modified  for  maritime 
targets);  atmospheric  attenuation  included. 
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TABLE  VI 


NAFI  Imaging  Infrared  Systems  Math.  Model® 


1.  No  search  probability  term. 

2.  Calculates  NETD ,  NEFE ,  and  S/N  (theoretical) 

(NEFE  =  noise  equivalent  fractional  emissivity) . 

3.  Computes  angle  subtended  at  the  sensor  by  the 

target  (calculated  by  geometry  of  situation 
and  compares  its  value  with  the  angle  sub¬ 
tended  by  a  single  line  of  resolution) . 

4.  Does  not  consider  mission  parameters,  computes 

range-altitude  performance  envelopes. 

5.  Based  on  MTF  including  image  motion. 

6.  Criteria:  1. 5-2.5  lines  for  detection 

5-10  recognition 

10-16  designation  (identification  or 
classification) 

7.  Atmospheric  model:  (Townsend's  model) 3-5  Microns  only. 

8.  Non-probability  organized  program. 

9.  Major  Basis:  MTFs 

10.  Delta  T  of  10  degs .  assumed  (or  target) 

S/N  of  2  required  for  detection. 

11.  For  Scanning  Systems  only. 


TABLE  VII 

TI  Probabilistic  Model  of  the  Target  Acquisition 
Process  for  FLIR® 


1.  Atmospheric  model:  Wyatt,  Stull  and  Plass,  based 

on  molecular  absorption  by  water,  and  CO^  gas 
and  particle  scattering  by  haze  and/or  light 
fog  particles. 

2.  Based  on  S/N  and  MTF  theory. 

3.  Based  on  Bailey's  formulation. 

4.  Human  operator  model  is  based  on  the  product  of 

3  terms?  target  contrast  term  based  on 
Stathacopoulos  et  al  and  relates  p  of  seeing 
to  threshold  contrast  as  a  function  of  avg. 
target  angular  subtense  at  eye.  Resolution 
term  based  on  method  of  Stringham  for  detection 
and  Bailey  for  recognition.  Displayed  S/N  term 
based  on  Westinghouse  and  Hillman. 

5.  System  model  in  that  weapon  delivery  considerations 

are  included. 

6.  Modulated  with  AN/AAD-4  FLIR  flight  tests  (Ref.  22). 

7.  No  search  term  in  evidence;  found  search  time  in  a 

cluttered  background  difficult  to  predict. 
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TABLE  VIII 


A  Mathematical  Model  for  Predicting  the  Probability  of  Detecting  Target 
Against  Various  Background  Utilizing  a  FLIR  System.  (TI  Classical  Model)10 

1.  Compares  resolution  element  covering  target  with  a  resolution  element 

of  background  radiation. 

2.  Classical  model  uses  NEP,  NET,  S/N,  D*. 

3.  Only  considers  target  detection.  No  human  factors  element  and  display 

unit  information  included. 

4.  Single  channel  only. 

5.  Excludes  atmospheric  scattering  and  reflection.  Ozone  absorption  in 

atmosphere  ignored. 

6.  Atmospheric  transmission  based  on  Larmo re/McGee  and  Larmore/Passman. 

Absorption  effects  of  CO^*  and  H^O  considered. 

7.  Considers  the  oblique  viewing  effect. 

8.  Statistical  analysis  of  the  mean  square  noise  fluctuation  using 

Gaussian  distribution.  Transforms  probabilities  into  a  contrast 
scale  to  determine  target/background  relative  contrast. 


TABLE  IX 

Flag  (Forward-Looking  Air-to-Ground)  Model  of  Hughes11 

1.  Includes  human  vision,  IR  and  TV  sensors. 

2.  A/C  flight  at  constant  velocity,  altitude  and  offset  toward  the 

target;  turns  toward  target  if  there  is  a  navigational  offset, 
includes  time  delays  for  mission  geometry,  and  weapon  preparation 
time. 

3.  Uses  scanning  angle  limits,  probabilities  thresholds,  sensor 

selector  and  target  size. 

4.  Detection  in  terms  of  C/C^  relative  contrast. 

5.  Presented  area  of  target  computed  in  geometry  subroutine. 

6.  Subtense,  angle  (alpha)  computed  from  sensor,  viewing  parameters. 

7.  Computes  line  number  included  in  the  image  subtense  at  the  sensor 

input. 

8.  For  recognition,  uses  resolution  elements  across  target  required 

for  .50  recognition.  Normalized  to  alpha,  (target  subtense  on 
display) .  Uses  eyeball  resolution  as  a  function  of  adaptation 
level . 

9.  Atmospheric  and  weather  model  included  (considers  relative  humidity 

and  ambient  temperature  and  two  weather  states) . 

10.  Includes  emmissivity  of  target  and  background  and  uses  delta  T 
in  IR  model. 
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TABLE  X 


NVL  (Ft.  Belvoir)  Thermal  Imaging  Systems  Model 


12 


1.  TYPE:  Calculational 

2.  PURPOSE:  Input  to  meaningful  trade-off  analysis  to  help  select 

devices  to  perform  specific  military  tasks.  To  predict 
target  detection-recognition  ranges . 

3.  BASIS:  Signal  to  noise/resolution  model  based  on  decision  theoretical 

considerations.  (Based  on  NET  rather  than  SNRD) 

4.  BASIC  EQUATION:  A  triple  integration  (two  spatial  and  one  temporal) 

of  a  Fourier  transform  of  the  signal  after  atmospheric 
attenuation,  a  Fourier  transform  of  the  signal  after 
device  and  eyeball  filtering  and  a  "complex  function  of 
the  signal  and  system  parameters,"  divided  by  the  noise 
power  spectrum  at  the  decision  point.  Detection  pro¬ 
bability  is  determined  by  a  S/N  threshold  function. 

5.  HUMAN  FACTORS:  Enters  into  determination  of  the  required  S/N  ratio. 

6.  ATMOSPHERIC  MODEL:  Hughes  work  based. on  Deirmendjian. 

7.  HARDWARE  PERFORMANCE  BASIS:  System  blur  spread  function  (uses  measured 

device  MTF  when  possible) . 

8.  TARGET  DIMENSION  OF  SIGNIFICANCE:  Area  (rather  than  minimum  dimension) . 

9.  RECOGNITION  CRITERIA:  Function  of  the  number  of  effective  resolution 

areas  (blur  areas  or  resolvable  spots)  within  the  target. 

10.  NOISE  COMPUTATION:  From  detector  and  display  plus  noise  power 

spectral  density  of  eye. 

11.  MODEL  SHORTCOMINGS:  Cannot  handle  the  effects  of  filtered  noise.  Simple 

model  that  does  not  predict  field  performance  (does  not  in¬ 
clude  enough  operational  and  mission  factors) .  Needs  expan¬ 
sion  to  handle  camouflage,  non  uniform  effects,  motion  and 
vibration  effects.  No  search  model,  although  one  is  planned. 
No  provision  for  clutter. 

12.  MODEL  VALIDATION:  Validated  against  Warren  Grove  field  tests  (predicts 

fairly  well) . 


STRONGPOINTS :  Good  cost-effectiveness  and  cost  trade-off  analyses  are 

possible  utilizing  NET  as  a  basis.  Very  simple  and  straight¬ 
forward  mathematically.  Can  be  expanded  into  comprehensive 
model  by  addition  of  subroutines  for  terrain,  climate, 
weather,  etc.  Cheap  and  economical  to  implement  in  computer 
programming . 

The  NVL  MODEL  as  detailed  in  the  reference  appears  to  be 
very  promising  as  a  simplified  system  model  of  target 
acquisition  using  thermal  imaging  systems.  It  needs  further 
expansion  to  include  more  of  the  facts  of  thermal  imaging 
systems  in  target  acquisition  tasks.  As  noted  on  the  summary 
sheet,  these  shortcomings  are  in  the  area  of  better  utilization 
of  target  signature  data,  scene  clutter  searqh  and  cuing  factors. 
The  variable  set  at  present  is  believed  to  be  too  small  to  be 
useful  where  real-time  operational  systems  and  real  eyeballs 
are  involved.  SNRD  as  an  observable  is  not  used  as  a  definitive 
term  in  the  probability  determination.  It  is  not  clear  what 
decision  factors  are  entered  into  the  decision  function 
discussed  in  the  report. 10  What  is  the  decision  criteria 
function?  how  is  this  determined;  how  does  this  enter  into 
the  final  decision  whether  a  target  is  there  and  what  kind 
of  target  it  is  (recognition)  all  need  clarification.  It 
appears  to  take  care  of  the  recognition  task  adequately  but 
not  the  detection  task  although  the  former  is  believed  to 
follow  sequentially  in  a  decision  scheme. 
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TABLE  XI 


Bailey  Model  of  Target  Detection  Through  Visual  Recognition 


13 


BASIC  EQUATION 


Pl°P2°Vn 


In  applying  his  model  to  the  case  of  locating  vans  contained  in  a 
mobile  SAM  site  with  a  FLIR,  Bailey  makes  several  enlightening 
assumptions  and  simplifications  in  the  quantification  of  his  conditional 
probabilities  (see  Table  II) : 


2. 


3. 


P^  and  are  assumed  to  be  unity  since  the  effects 

of  search  are  incorporated  into  P^  and  the  display 
is  assumed  to  be  "ideal”, that  is,  only  signal  to 
noise  limited.  He  assumes  no  search  is  required 
because  the  target  is  easily  located  in  a  circum¬ 
scribed  area  (SAM  site)  and  because  the  display 
provides  sufficient  contrast,  scaling,  brightness 
of  display,  etc. 


exp 


r- 

f  L  . 

[  min 

I  ;  N  >0.4 

L 

1-4  a(Rs  +  V  .  td) 

1 

J  r  " 

1  -  exp 


*vv’] 


N  -  the  number  of  resolution  cells  contained  in  the  shortest 

r 

dimension  across  a  target 


where  L  .  is  the  minimum  dimension  of  the  target  (shortest 
projected? 

V  is  the  velocity  of  the  aircraft 
NET  is  the  Noise  Equivalent  Temperature 

t^  is  the  correction  factor  for  dynamic  viewing  condition 
(from  static  detection) 

Rg  is  the  range  of  the  sensor  from  the  target  (slant  range) 

AT  is  the  effective  temperature  difference  of  the  target  com¬ 
pared  to  the  scene  temperature 

a  is  the  absorption  coefficient  of  the  atmosphere, dependent 
mostly  on  water  content 

a  is  the  resolution  of  the  FLIR 


Note  that  Bailey  uses  the  criteria  of  number  of  resolvable  elements 
which  can  be  interpreted  as  resolvable  spots  or  bars  of  a  stylized 
(Johnson)  target  in  the  smallest  dimension  of  the  target  for  recogni¬ 
tion. 

Note  also  that  Bailey  uses  a  detector  channel  signal  to  noise  (NET) 
ratio  rather  than  a  SNRD  in  the  noise  degradation  factor. 

MODEL  STRONGPOINTS  -  AND  SHORTCOMINGS  -  The  Bailey  model  is  straight¬ 
forward,  empirical  in  basis  and  easy  to  comprehend.  Some  of  the 
computations  (like  resolvable  elements,  as  described  in  Reference  4) 
are  complex,  however.  It  also  takes  into  consideration  the  psycho¬ 
physics  of  target  acquisition.  The  applications  given  in  the  reference 
indicate  that  the  4  basic  components  of  the  model  are  not  meant  to  be 
used  all  the  time;  their  use  depends  upon  the  sensor  and  mission 
characteristics.  This  allows  ample  room  for  the  analyst  to  use  his 
judgment  as  to  the  limiting  factors  to  be  used  for  all  types  of  systems 
and  differing  operational  missions.  Note  also  that  in  giving  an 
example,  Bailey  sidesteps  having  to  include  search  parameters  (P^  &  p2 
are  assumed  to  be  unity) .  Note  also  that  he  assumes  an  ideal  display 
which  also  sidesteps  knotty  display  problems. 
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TABLE  XII 


EOS  Model  of  FLIR  Target  Acquisition14 


1.  Basis:  MTF  and  Noise  Equivalent  Temperature  (synthesized  into  SNRD) 
Predictions  are  based  on  Bar-space  Effective  Temperature 
Difference,  Display  average  brightness  and  System  AC  Gain. 


2.  Uses  the  concept  of  Resolvable  Temperature. 


3. 


4. 


Purpose  of  the  Model:  to  predict  the  outcome  of  an  IR  Sensor  Flight 
Test  program  using  bar-type  targets. 


Target  Acquisition  Model:  based  on  Bailey's  model.  Basic  target 
acquisition  equations: 


PD  =  P(L)  ‘  P (D/L) 

P-.  *  P(L)  *  P  (D/L)  °  P  (R/D) 
R 


P (L)  »  probability  of  looking  at  the  target 
P (D/L)  =  probability  of  detection  given  looking 
P ( R/D )  -  probability  of  recognition  given  detection 


This  model  does  not  address  search  since  in  the  test  validating  it 
the  operator  knew  both  where  to  look  and  what  he  is  looking  for. 
Note  that  this  is  a  conditional  probability  model.  It  does  not 
consider  mission  or  flight  parameters  directly.  This  model  starts 
with  the  target  on  the  display  .  PR  is  equated  to  bar-breakout  (the 
recognition  criteria)  utilizing  display  signal-to-noise  ratio  for 
determining 

P  =  Probability  (S  +  noise)  >  1.8 ‘sigma 
D/L 


AT  is  defined  as  the  temperature  difference  which  provides  a  S/N 
(Perceived)  =  1.8.  Uses  Johnson's  criteria  for  determining  the 
number  of  resolvable  elements  required  to  discriminate  the  shape 
of  the  target.  Uses  the  number  of  elements  on  the  target  minimum 
dimension  (a  la  Johnson) . 

P(R)  =  1  -  e  (N/4  .5)  n  «  #  of  elements  on  the 

target  minimum  dimension 


5.  Shortcoming  of  the  model:  This  model  was  especially  derived  to 
predict  the  results  of  a  test  program  having  very  restrictive 
techniques  and  objectives  and  hence  is  not  easily  expanded  or 
applied  to  other  test  programs,  operational  conditions  or  to 
other  types  of  targets. 

6.  Strong  points:  Considers  the  eye-brain  portion  of  the  target  acquisi¬ 
tion  system  and  includes  display  parameters  not  included  in  many 
models . 
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Theoreticians  who  use  the  bar  target 
equivalent  notion  express  their  reliance 
on  the  empirical  relations  first  formulated 
by  Johnson  1?  that  real  target  acquisition 
and  stylized  target  resolution  are  related 
(in  a  basically  not  well  understood  manner) . 
The  Bailey/Martin  formulation  falls  in  this 
category. 

Moser's  analytical  work6  in  the  classi¬ 
fication  of  ships  has  shown  that,  for  this 
class  of  tasks,  the  bar  target  equivalent 
notion  is  definitely  inapplicable  and  the 
resolvable  spot  notion  definitely  valid. 

This  kind  of  discrimination  between  notions 
has  not  been  derived  for  terrestrial,  com¬ 
pact  targets.  Some  pessimism  about  the 
generality  of  the  bar  target  equivalent 
notion  has  been  expressed  by  Rosell.1® 

Sufficient  confidence  has  been  expressed 
in  bar  target  equivalents  in  the  past  that 
one  of  the  primary  laboratory  measures  of 
FLIR  performance,  the  MRT,  is  based  on  the 
imaging  of  a  stylized  bar  target.19  Also, 
electro-optical  field  test  facilities,  such 
as  the  Navy's  at  Patuxent  River,  have  simi¬ 
lar  large  scale  stylized  bar  targets .  Models 
using  the  resolvable  spot  notion  should  pro¬ 
perly  develop  a  corresponding  sensitivity- 
resolution  relation  to  replace  the  MRT. 

The  appeal  of  the  bar  target  equivalent 
approach  is  that  FLIR  system  specification 
and  design,  laboratory  and  field  bar  break¬ 
out  tests,  and  real  target  acquisition  per¬ 
formance  can  be  readily  associated  with 
each  other. 

All  models  which  reference  the  target 
acquisition  performance  to  the  information 
presented  on  a  CRT  display  rely  on  the 
notion  of  the  SNRD.  As  basic  as  this  para¬ 
meter  is  to  the  modelling,  it  is  usually 
presented  as  a  derived  rather  than  a  meas¬ 
ured  quantity.  This  situation  is  unsatis¬ 
factory  for  a  discipline  attempting  to 
describe  observable  phenomena.  Some  efforts 

to  systematize  the  measurement  of  SNRD  are 

1  A  on 

being  developed  by  Synder.-1-0'^ 


DISCUSSION 

Target  acquisition  involves  the  trans¬ 
fer  of  information  from  a  scene  to  an 
observer.  The  transfer  is  effected  by 
the  sensor-display  combination.  The 
sensor's  information  transfer  to  the 
display  is  described  analytically  by 
sensitivity-resolution  relations  (and 
the  like),  and  the  observer's  apprecia¬ 
tion  of  the  information  is  described  by 
phenomenological  threshold  relations 
(such  as  the  Blackwell  criteria) . 

All  complete  models  of  target  acquisi¬ 
tion  must  eventually  be  reducible  to  the 
same  set  of  observable  or  measurable 
quantities  and  threshold  relations. 

Because  of  this,  the  discussion  of  any 
one  complete  model  is  sufficient  to 
identify  weaknesses  in  the  understanding 
of  the  phenomena  of  target  acquisition. 

The  Bailey/Martin  model  is  one  that 
has  attempted  to  be  complete ,  in  that 
search  through  pattern  recognition  is 
included.  Thresholds  of  the  eye,  dwell 
time,  contrast-magnification,  sampling- 
magnification  and  signal  to  noise  type 
relations  form  the  bases  for  the  func¬ 
tional  form  of  the  model.  The  structure 
of  the  model  is  described  in  Table  III 
and  the  physical  observables  and  thres¬ 
hold  relations  in  each  conditional  pro¬ 
bability  are  shown  in  Figure  1.  The  con¬ 
ditional  probabilities  are  designed  around 
the  observables  and  functions  which  seem 
to  be  of  primary  importance  in  FLIR  target 
acquisition. 

The  figure  shows,  underlined,  the  FLIR 
related  observables  and  threshold  relations 
which  need  to  be  better  understood,  meas¬ 
ured,  or  modelled.  A  natural  result  of 
improving  these  is  better  FLIR  target 
acquisition  modelling. 
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Probability 


(detection/recognition/identification) 


P 


1 


L  Atmospheric  transmittance ,  target  and  back¬ 
ground  thermal  signature,  resolution-sensitivity 
relation,  target  size. 


System  magnification,  contrast  displayed, 
Johnson  criteria  (detection/recognition/ 
identification),  system  resolution,  target 
size. 


Contrast  threshold ,  ambient  versus  display 
brightness,  displayed  target  size,  number 
of  display  gray  shades,  displayed  contrast, 
target  size. 


Search  type,  clutter ,  v/h,  FOV,  cueing  or 

- - - - — -prebriefing,  glimpse  rate  and  dwell  time, 

search  area. 

Figure  1.  Description  of  a  Typical  FLIR  target  acquisition  model  in  terms 
of  the  physical  observables  and  threshold  relations  which  govern  the 
phenomena  of  target  acquisition. 


RECENT  CONTRIBUTIONS 

This  section  describes  recent  contri¬ 
butions  which  impact  the  physical  observ¬ 
ables  and  threshold  relations  areas  in¬ 
fluencing  FLIR  target  acquisition  modelling 
as  described  in  the  previous  section  and 
Figure  1 . 

Clutter 

Clutter  can  be  described  as  elements  of 
the  scene  which  have  roughly  the  same  con¬ 
trast  and  dimensions  as  a  real  target  in 
the  scene.  The  importance  of  clutter  is 
mainly  in  the  search  function.  In  recent 
work, ^  subjects  were  asked  to  identify 
frames  from  AN/AAQ-5  filmed  imagery  which 
could  be  quantitatively  described  as 
"cluttered,”  "moderately  cluttered"  and 
"uncluttered"  (well  defined  target  in  an 
open  field) .  In  an  effort  to  find  a 


measure  or  discriminant  for  clutter, 
densitometer  traces  were  made  of  these 
qualitatively  characterized  scenes. 
Fourier  transforms  and  power  spectral 
densities  of  the  linescan  were  then 
made.  It  was  found  in  this  way  that 
the  power  spectral  density  (in  the 
spatial  frequency  domain)  is  a  dis¬ 
criminant  between  cluttered  and  un¬ 
cluttered  scenes.  This  is  illustrated 
in  Figures  2  and  3.  The  discrimination 
is  not  yet  quantized,  but  it  is  interest 
ing  to  note  that  this  discriminant  is 
defined  in  the  same  domain  that  is  used 
to  describe  the  resolution  and  resolu¬ 
tion-sensitivity  relations  of  a  sensor. 
It  is  hoped  that,  in  the  future,  system 
studies  of  the  resolution  requirements 
of  a  mission  will  include,  because  of 
this  discriminant,  the  impact  of  clutter 
imaged  by  the  FLIR  system. 
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Fourier  power  spectrum 


Spatial  frequency  (arbitrary  units) 

Figure  2.  Power  Spectrum  Plot  for  a  Typical  Cluttered  Scene 


140 


Fourier  power  spectrum 
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Scene  Statistics 


All  models  surveyed  to  date  which  included 
contrast  thresholds  also  use  a  deterministic 
apparent  or  displayed  contrast.  It  is  argued 
in  Reference  3  that  the  expressions  involving 
contrast  should  be  weighed  by  some  form  of 
statistics.  Targets  have,  in  a  given  scene, 
approximately  constant  emissivities  and 
temperature  distribution.  Thus,  the  target/ 
background  contrast  at  the  scene  really 
follows  the  scene  statistics  of  the  thermal 
background.  Similarly,  at  the  display,  the 
target/background  contrast  statistics  roughly 
follows  the  brightness  statistics  of  the 
scene.  Reference  3  has  compiled  typical 
scene  statistics  as  determined  from  filmed 
(display)  imagery  of  the  AN/AAQ-5  FLIR. 

This  kind  of  statistics  should  be  used  to 
form  a  weighting  function  for  the  dis¬ 
played  contrast  of  a  FLIR  system. 

Sampling-Magnification  Relations 

23 

Williams  has  performed  computer 
simulations  of  the  effect  of  FLIR- 
like  sampling  on  pattern  recognition. 

Using  a  variety  of  subjects,  he  has 
determined  that  (almost  independent 
of  the  number  of  target  pattern  samples) 
there  is  a  system  magnification  which 
gives  a  local  optimum  pattern  recogni¬ 
tion  performance.  The  optimum  is  "local'1 
because,  naturally,  the  absolute  per¬ 
formance  increases (in  his  tests)  monot- 
onically  with  sampling  number. 

These  effects  should  be  included  in  the 
FLIR  target  acquisition  model,  because  the 
system  magnification  represents  the  inter¬ 
play  among  sensor  FOV,  display  size  and 
viewing  distance. 

Detection  and  Identification  Sampling 

Requirements 

.  ?4 

Rugan^  has  reported  FLIR  airborne  target 
acquisition  measurement.  These  tests  can 
be  roughly  characterized  as  taking  place  in 
a  clutter  free  scene  and  in  a  high  signal 
to  noise  situation.  Thus  the  tests  approx¬ 
imately  test  the  pattern  recognition  per¬ 
formance  of  a  man-FLIR  combination.  Rugari 
has  found  that  the  results  can  be  modelled 
by  invoking,  as  a  stylized  target,  a  rec¬ 
tangle  which  circumscribes  the  target.  The 
pattern  recognition  requirement  then  is 


related  to  a  minimal  sampling  of 
this  area  or  of  the  geometric  mean 
of  its  sides.  Analyses  have  not 
yet  been  performed  to  see  whether 
this  measure  of  sampling  requirements 
is  in  agreement  with  the  bar  target 
equivalent  and/or  the  resolvable  spots 
measures. 

FLIR  Target  Acquisition  Measurement 

and  Analyses 

EOS  and  WPAFB  have  designed  and 
conducted  14,25  an  extensive  and 
rigorous  set  of  FLIR  target  acquisi¬ 
tion  measurements,  using  the  Navy’s 
Patuxent  River  E-0  Test  Facility. 

The  tests  included  measurements  of 
bar  breakout  and  truck  detection  and 
recognition  ranges.  Backgrounds  were 
varied  and  atmospheric  temperature  and 

humiditv  determinations  were  made.  To 
maximize  the  experimental  samples  in 

each  set  of  the  test  program,  flight 
profiles  were  made  essentially  constant. 

EOS  has  performed  a  variable  analysis  . 
on  the  flight  test  measurements;2^  further) 
analyses  are  in  progress. 

An  alternate  approach  to  the  data 
analysis  was  presented  by  Mendez. 2^ 

The  approach  is  based  on  the  assump¬ 
tion  that  P^  (see  Table  III)  represents 
the  effect  of  SNRD  on  target  acquisition 
and  the  hypothesis  that  the  SNRD  is  the 
dominant  factor  in  target  acquisition. 

Then  the  MAFLIR  flight  test  results  are 
reduced  to  clear  atmosphere  for  inter¬ 
pretation  according  to  the  following 
procedure : 


1. 


^  *  p>|(r  t  )/  where 
4  meas  4  calc 


r  measured  real  atmosphere 
meas  c 

detection  or  recognition  range , 


K=const. ;  a  (r) =atmosphere  trans¬ 
mittance  at  range  r 
At= thermal  signature 

rcalc=Ca^CU^ateC^  c-*-ear  atmosphere 
detection  or  recognition  range 
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KRT (r)=resolution-sensitivity  relation 
expressed  as  a  function  of  range  rather 
than  spatial  frequency. 

3.  For  a  given  target  and  background, 
1.)  and  2.)  reduce  to 


TalrmeasJ  _  1  . 

m5t7  r  V  MRT/ r  n 

I  measl  I  calcl 

In  this  approach,  it  is  seen  that  the 
FRT  is  interpreted  as  a  transfer  curve 
which  can  be  used  to  reduce  the  flight 
test  measurements  to  clear  atmosphere. 

Then  the  reduced  data  can  be  interpreted 
in  the  sense  of  Johnson  criteria  (for 
example) .  The  analysis  procedure  is 
depicted  graphically  in  Figure  4.  To 
estimate  t  (r)  from  the  temperature  and 
relative  humidity  measurement  in  the 
MAFLIR  program,  a  model  atmosphere  due 
to  Barhydr  was  used. 

The  approach  described  above  makes 
it  essential  to  fix  and  to  know  the  MRT 
cr  RT  of  the  system  (preferably  measured 
during  the  flight  test) . 

The  MAFLIR  flight  tests  are  difficult 
to  interpret  in  a  rigorous  sense  because 
the  MRT  of  the  system  was  altered  in  the 
course  of  the  flight  tests,  and  the  experi¬ 
mental  design  made  the  RT  one  of  the 
variables  in  the  course  of  the  flight 
test  program.  (Hopefully,  future  FLIR 
target  acquisition  measurement  programs 
will  exercise  care  in  this  regard) . 


Nonetheless,  analyses  performed  to  date 
on  these  data  indicate  that  the  results 
can  be  interpreted  in  the  sense  of  the 
bar  target  equivalent. 

The  algorithm  described  in  this  section 
is  a  first  order  approximation  because  it 
ignores  angular  subtense  dependent  thres¬ 
hold  relations.  Corrections  of  this  type 
can  be  added  by  including  a  P  -like  term 
(see  Table  III)  in  the  algorithm.  This 
has  not  yet  been  done.  More  controlled 
measurements  of  the  MAFLIR  type  are 
required. 

CONCLUSIONS 

A  survey  of  FLIR  target  acquisition 
models  has  been  presented.  These  range 
in  approach  from  analytical  to  phenomenol¬ 
ogical.  Models  of  the  "rule  of  thumb" 

2ft 

type  were  not  included. 

Most  models  are  synthesized  from  analyses 
and  intepretations  of  independent  experi¬ 
ments  performed  to  isolate  the  effect  of  a 
specific  set  of  physical  observables.  The 
models  do  not  exhibit  agreement  with  respect 
to  the  interpretations  or  influences  of 
pattern  recognition  and  S/N  (video  or 
display) . 

The  status  and  rigor  of  the  modelling  is 
dependent  on  test  and  analyses  programs  of 
the  kind  represented  by  the  MAFLIR  related 
efforts.  Much  more  data  of  the  MAFLIR  kind, 
with  better  experimental  design,  is  required 
in  order  to  define  an  unequivocal  FLIR  target 
acquisition  model  which  is  valid  from  search 
through  the  higher  tasks  of  target  acquisi¬ 
tion. 


Figure  4.  Graphical  description  of  the  use  of  the  MRT  (or  RT)  as 
a  transfer  curve  to  reduce  real  atmosphere  target  acquisition  range 
measurements  to  clear  atmosphere. 
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ABSTRACT 


Results  of  studies  from  the  published 
Literature  were  used  as  data  for  an 
analysis  of  tasks  involving  search  for 
simple  targets.  Absolute  values  were 
plotted  across  studies  on  a  series  of 
common  graphs.  The  relationships  between 
the  dependent  measure  (search  time)  and 
various  independent  variables  were 
examined  for  systematic  trends.  The 
-analysis  yielded  a  number  of  findings 
Lncluding  estimates  of  maximum  scann¬ 
ing  rate  and  information  processing 
rates.  Since  these  data  were  based 
on  search  for  simple,  well-learned 
target  symbols,  they  are  suggested 
as  providing  an  estimate  of  the  lower 
Limits  of  search  performance  which  can 
be  expected. 

INTRODUCTION 

There  appear  to  be  three  kinds  of 
/isual  search  situations  that  have 
been  studied  experimentally.  One  is 
characterized  by  temporal  uncertainty. 

The  searcher  knows  where  the  target 
will  appear,  but  he  does  not  know  when. 
Generally,  when  it  does  appear,  the 
target  is  of  short  duration  and  easily 
missed  if  the  locus  of  search  at  that 
moment  is  elsewhere.  The  appropriate 
measure  of  performance  in  this  case  is 
the  frequency  or  proportion  of  targets 
detected.  Research  concerned  with  this 
problem  has  concentrated  on  the  effects 


of  target  characteristics  as  they 
influence  detectability,  e.g.  con¬ 
trast,  size  etc.,  and  with  the  ques¬ 
tion  of  decrements  in  target  acquisi¬ 
tion  as  the  search  period  is  prolonged 
(vigilance).  We  have  recently  consid¬ 
ered  aspects  of  both  elsewhere  (Teich¬ 
ner  and  Krebs,  1972,  In  press;  Teich- 
ner,  1972). 

A  second  kind  of  visual  searching 
is  characterized  by  positional  uncer¬ 
tainty.  In  this  situation  the  tar¬ 
get  is  known  to  be  present  and  it 
usually  remains  so  until  detected. 

What  is  unknown  is  the  the  location 
of  the  target.  The  appropriate 
measure  of  performance  is  search 
time,  i.e.,  the  time  required  to 
find  the  target.  Of  course,  targets 
may  have  both  positional  and  temporal 
uncertainty  and  these  more  complex 
situations  comprise  the  third  kind 
of  search  situation  that  has  been 
studied.  All  three  experimental 
paradigms  are  also  representative 
of  realistic  situations. 

In  studies  involving  temporal 
uncertainty,  i.e.,  Type  I  and  Type 
III,  both  search  time  as  well  as  the 
proportion  of  targets  detected  have 
been  used  as  a  dependent  measure.  It 
is  important  to  distinguish  between 
search  time  as  used  in  those  studies 
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and  search  time  as  used  in  the  Type  II 
case  which  involves  only  positional  uncer¬ 
tainty.  When  targets  are  missed,  search 
time  reflects  only  those  searches  which 
were  successful.  Instances  of  missed 
targets  contribute  to  the  calculated 
proportion  of  detections,  but  not  to  the 
average  search  time.  Thus,  comparisons  of 
search  times  may  be,  and  usually  are,  not 
possible  under  conditions  of  equal  target 
acquisition.  On  the  other  hand,  in  the 
Type  II  case  where  all  targets  are  detec¬ 
ted,  from  one  comparison  condition  to 
another  search  time  represents  the  same 
quality  of  target  acquisition  performance. 

It  would  seem  that  somewhat  different 
processes  are  implied  by  the  two  kinds 
of  search  time.  Search  time  obtained 
from  a  study  having  only  temporal  uncer¬ 
tainty,  at  least  in  part,  involves  a 
process  which  leads  to  decisions  of  wheth¬ 
er  or  not  searching  should  be  continued, 
whereas  search  time  obtained  from  situa¬ 
tions  having  only  positional  uncertainty 
would  appear  to  be  more  purely  represen¬ 
tative  of  the  search  process  as  such. 

Both  factors,  those  which  determine  the 
continuance  of  searching  and  thos2 which 
determine  the  nature  of  searching,  are 
critical  to  an  understanding  of  target 
acquisition  in  all  three  situations. 

This  paper  is  concerned  with  search 
time  in  situations  having  only  positional 
uncertainty.  The  experimental  question, 
therefore,  is  that  of  how  long  it  takes 
to  find  a  target  and  not  that  of  whether 
the  target  will  be  found.  In  the  last 
few  years  there  has  been  a  considerable 
increase  of  interest  in  the  problem 
beyond  the  context  of  target  acquisition, 
and  with  respect  to  the  nature  of  human 
information  processing.  Experimental 
investigations  have  been  directed  toward 
such  issues  as  serial  vs.  parallel  data 
handling,  feature  testing  models,  scanning 
models,  and  toward  the  nature  of  attention 
and  short-term  memory.  Methodologically, 
the  experiments  require  searching  for  a 
target  or  datum  in  an  array  of  non-target 
stimuli.  They  differ  from  the  convention¬ 
al  target  acquisition  study  in  that  they 
de-emphasize  properties  of  the  stimulus 
and  emphasize  characteristics  of  the 
searcher. 


Positional  uncertainty  studies 
may  be  viewed  in  still  another  context, 
the  choice  reaction  time.  More  speci¬ 
fically,  they  may  be  seen  as  special 
cases  of  that  type  of  choice  reaction 
study  in  which  the  individual  is  re¬ 
quired  to  respond  to  selected  stimuli 
and  required  not  to  respond  to  any 
others.  At  this  time  we  wish  only  to 
emphasize  as  strongly  as  possible  that 
no  matter  how  it  is  conceived,  visual 
target  acquisition  involves  both 
information  processing  and  visual 
factors.  Our  effort  was  aimed,  at 
the  information  processing  aspect  of 
the  problem.  Specifically,  we  were 
concerned  with  the  question  of  whether 
available  results  are  sufficient  to 
provide  relevant  quantitative  search 
time  functions  for  Type  II,  the  posi¬ 
tional  uncertainty  case. 

METHOD 

As  with  others  of  our  recent 
studies,  it  was  our  hope  to  employ 
the  absolute  measures  available  in 
the  literature  as  our  data.  By  mani¬ 
pulation  of  those  data  we  wished  to 
determine  if  there  is  enough  consis¬ 
tency  within  the  literature  to  permit 
fitting  functions  which  hold  across 
the  data  of  different  studies.  The 
data  analysis  starts  with  the  assump¬ 
tion  that  only  one  independent  variable 
determines  most  of  the  variation  of 
the  dependent  measure.  A  failure 
to  support  that  assumption  forces  the 
further  assumption  of  two  independent 
variables  as  primary.  How  far  one 
goes  in  this  direction  depends  upon 
what  is  available  in  the  literature 
and  how  ingenious  one  is  in  manipulat¬ 
ing  it.  It  also  depends  upon  success 
in  establishing  a  literature-derived 
data  base  which  contains  the  relevant 
results  of  quality  investigations. 
Finally,  what  is  relevant  is  deter¬ 
mined  by  the  restrictions  of  the 
investigation.  For  this  study,  the 
following  constraints  were  imposed 
upon  the  selection  of  the  experimental 
studies  to  be  used: 

\ 

1.  The  stimuli  used  had  to  be  of 
an  intensity  such  that  they 
were  easily  detectible  and 
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identifiable  as  individual  stim¬ 
ulus  elements. 

2.  The  stimuli  had  to  be  simple  and 
familiar  elements  such  as  letters, 
colors,  and  geometric  shapes 
which  might  be  expected  to  be 
known  to  subjects  without  specific 
learning.  For  alphanumeric 
stimuli  only  studies  using  single 
digits  or  letters  were  employed. 

3*  Studies  involving  masking, 

distorting,  or  any  other  form  of 
noise  were  not  used.  However, 
studies  using  irrelevant  or  non¬ 
target,  simple  stimuli  were  used 
if  those  stimuli  were  part  of 
the  searched  array. 

4.  Subjects  had  to  be  reasonably 
well-practiced  at  the  task.  In 
general,  this  was  determined  by 
the  studies  themselves  since  the 
amount  of  practice  given  prior 
to  obtaining  the  reported  data 
was  highly  variable.  In  two 
cases  (to  be  discussed)  the  data 
were  taken  from  the  middle  portion 
of  experiments  having  unusally  ex¬ 
tensive  practice  in  order  to  keep 
the  learning  factor  approximately 
constant. 

5*  The  amount  of  error  of  all  kinds 
(omissions  and  commissions)  had 
to  be  no  more  than  five  per  cent. 
This  criterion  interacted  with 
and  partly  determined  what  was 
considered  to  be  "reasonable" 
practice.  One  minor  exception 
was  made  in  this  regard  and  that, 
too,  will  be  considered  later.  * 

6.  From  the  subject*s  point  of  view 
the  possible  locations  of  the 
target  had  to  be  well-defined  and 
known.  This  constraint  precluded 
studies  of  empty  fields. 

7*  All  elements  of  the  display, 

targets  and  non-targets,  had  to 
be  present  from  the  onset  of 
viewing  to  the  occurence  of  a 
response  indicating  detection. 


RESULTS 

The  reported  data  of  most  studies 
were  in  the  form  of  the  total  time 
required  to  find  the  target.  However, 
studies  using  more  than  one  target 
per  display  reported  the  time  to 
detect  all  of  the  displayed  targets. 

As  it  turned  out,  we  were  unable  to 
handle  those  studies  in  the  same  con¬ 
text  as  the  others.  Furthermore, 
those  were  too  few,  with  too  much 
interstudy  variability,  to  warrant 
a  separate  analysis.  Therefore,  stud¬ 
ies  displaying  more  than  one  target 
at  a  time  were  discarded. 

Some  studies  displaying  only  one 
target,  reported  the  average  search 
time  per  subgroup  of  stimuli  in  an 
array.  For  example,  Neisser  (1963) 
and  subsequent  studies  based  upon 
his  findings  (e.g.  Neisser,  Novick 
and  Lazar,  1963;  Cohen  and  Pew,  1970) 
reported  the  time  per  row  in  a  matrix 
of  alphanumeric  symbols.  Their  data 
were  converted  to  total  search  time, 
i.e.  the  time  required  from  the  start 
of  a  trial  until  the  target  was  report¬ 
ed  and  to  the  time  per  stimulus  or 
single  symbol  of  the  display. 

It  was  also  considered  necessary 
to  modify  the  description  of  the  number 
of  stimuli  per  array  reported  by  all 
studies.  All  authors  reported  the 
average  search  time  over  a  set  of 
displays  in  which  the  position  of  the 
target  was  experimentally  balanced 
within  the  set.  For  example,  an 
experiment  using  an  array  containing 
50  stimuli  (49  non-targets  and  one 
target)  would  have  balanced  the  posi¬ 
tion  of  the  target  over  the  set  of 
displays.  The  mean  search  time  of 
the  set,  then,  represents  a  mean  posi¬ 
tion  location.  The  mean  search  time 
in  this  example  is  the  search  time  for 
one-half  of  the  number  of  elements  in 
the  array,  i.e.  25*  Therefore,  the 
data  to  be  presented  which  deal  with 
the  number  of  stimuli  in  the  array  will 
describe  this  variable  as  the  "half 
number  of  stimuli" 

With  the  restrictions  listed 
above,  a  systematic  search  of  the 
abstracted  literature  between  i960  and 
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1971*  plus  references  from  bibliographies, 
reviews  and  individual  papers  yielded  70 
studies  of  which  12  provided  Ik  experiments 
having  acceptable  data.  All  of  those 
studies  required  visual  scanning  of  an 
array  which  was  at  a  distance  from  the 
subject.  Paper-pencil,  letter-cancelling 
tasks  were  not  accepted  because  the  search 
time  measures  were  assumed  to  be  inflated 
by  measurement  with  a  stop-watch  and  by 
the  manual  aspect  of  the  task. 

We  started  by  assuming  that  the  only 
important  variable  was  the  number  of 
stimulus  elements  in  the  array.  As  will 
be  seen  we  also  considered  the  size  of 
the  source  from  which  the  targets  were 
selected  (number  of  possible  different 
targets),  the  size  of  the  visual  field, 
the  nature  of  the  stimulus  elements,  and 
the  amount  of  stimulus  information. 

Figure  1  presents  the  total  search 
time  required  to  find  a  single  target 
when  only  one  kind  of  target  is  possible 
and  when  that  target  appears  only  once 
in  the  array.  The  results  of  eight  differ¬ 
ent  studies  reporting  11  experiments  are 
shown  on  a  semi-log  plot  for  which  the 
abscissa  is  one-half  of  the  number  of 
stimuli  actually  present  in  the  array. 

The  four  lines,  drawn  by  eye,  are  intended 
only  to  provide  a  clearer  picture  of  the 
trends.  The  study  by  Lehtio  (1970)  differs 
from  all  of  the  other  studies.  In  that 
study  all  stimuli  including  the  target 
were  digits  and  all  varied  in  color  and 
size.  When  the  target  was  a  particular 
color,  it  was  the  only  instance  of  that 
color.  When  it  was  a  digit  it  was  the 
numeral  3*  and  the  only  3  in  the  array, 
etc.  Although  the  remaining  studies  in 
the  figure  are  similar  in  that  they  are 
all  cases  of  the  target*s  differing  or 
being  selected  from  other  similar  stimuli 
on  the  basis  of  a  single  attribute,  that 
attribute  was  the  only  difference  between 
the  target  and  the  non- targets.  Lehtio*s 
targets  differed  on  two  other  dimensions 
as  well. 

Figure  1  represents  a  variety  of  spe¬ 
cific  kinds  of  target  and  non-target 
stimuli.  Ignoring  the  results  of  Lehtio, 
the  figure  suggests  that  the  shortest 
search  times  were  associated  with  finding 
a  target  circle  which  was  larger  than  the 
remaining  circles  of  the  display.  The 


next  most  rapid  detection  occurred 
when  the  target  was  a  red  circle 
among  green  circles  of  the  same  size 
for  arrays  having  half  numbers  of 
stimuli  greater  than  100.  The  next 
longer  search  time  was  associated 
with  finding  a  triangle  in  an  array 
of  circles,  and  the  slowest  search 
with  finding  a  square  among  an  array 
of  circles. 

The  difference  between  the  trends 
for  a  square  target  and  a  triangular 
target  when  each  is  embedded  in  an 
array  of  circles  may  be  due  impor¬ 
tantly  to  the  differences  between 
investigative  methods.  In  spite  of 
that  possibility,  it  is  worth  noting 
that  a  square  is  more  similar  to  a 
circle  than  is  a  triangle  and,  there¬ 
fore,  that  the  square  represents  a 
target  which  should  be  harder  to  dis¬ 
criminate  from  the  rest  of  the  array. 

The  curve  drawn  through  the 
triangular  target  data  of  Brody, 

Corbin  and  Volkmann  (i960)  and  Volk- 
man  et  al  (1964)  also  fit  the  search 
times  obtained  from  arrays  using  alpha¬ 
numeric  data.  The  one  large  exception 
is  the  study  by  Cohen  and  Pew  (1970). 
That  study  may  be  compared  directly 
with  the  one  by  Neisser,  Novick  and 
Lazar  ( 1963)  since  it  was  an  exact 
replicate  in  all  respects  except 
that  the  subjects  were  instructed  to 
emphasize  accuracy  in  finding  the 
target  whereas  Neisser,  Novick  and 
Lazar's  subjects  and  those  of  the 
other  studies  shown  were  given  speed 
as  the  instructional  emphasis. 

The  search  time  measures  were 
usually  reported  in  milliseconds. 
However,  in  order  to  reduce  varia¬ 
bility,  Figure  1  and  subsequent  uses 
of  search  time  as  a  dependent  measure 
plots  the  data  to  the  nearest  .01 
second. 

In  general  Figure  1  suggests 
that  at  first  search  time  is  almost 
unaffected  as  the  number  of  stimuli 
in  the  array  increases,  but  that  after 
some  critical  number,  search  time 
begins  to  increase.  That  observation 
was  also  made  by  Brody,  Corbin  and 
Volkman  ( i960) .  The  inflection  point 
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Figure  1 .  Search  Time  as  a  Function  of  the  Half  Number  of  Stimuli  in  the  Display 
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and  the  slope  appear  to  be  related  to 
the  ease  with  which  the  target  can  be 
discriminated  from  its  background.  The 
more  difficult  the  discrimination,  the 
smaller  the  number  of  stimuli  at  the  inflec¬ 
tion  point,  and  the  steeper  the  rise  after 
the  inflection  point. 

The  data  of  Lehtio  are  completely 
out-of-line  with  any  of  the  trends.  His 
data  are  particularly  interesting,  however, 
since  they  suggest  a  different  kind  of 
search  variable.  That  is,  in  all  of  the 
studies  shown,  excepting  Lehtio's,  the 
subject  could  arrive  at  a  decision  by 
finding  a  stimulus  which  was  different 
from  the  rest  in  any  respect.  Since  in 
Lehtio's  study  the  stimulus  elements 
differed  with  regard  to  combinations  of 
two  attributes,  the  subject  had  to  find 
the  target  in  terms  of  its  unique  attri¬ 
bute.  After  that,  he  may  or  may  not  have 
also  acted  selectively  within  the  target. 

If  we  give  credence  to  the  difference  in 
Lehtio's  results,  it  is  reasonable  to 
infer  that  his  subjects  were  required  to 
search  for  a  specific  attribute  difference 
in  an  array  of  differences  and,  therefore, 
to  inspect  each  stimulus  as  an  individual 
source  of  information  whereas  in  all  other 
studies  shown,  the  search  was  for  any  dis- 
homogeneity  in  the  array.  The  subjects 
of  those  other  studies,  therefore,  could 
have  swept  the  array  in  a  fast  scan. 

Results  with  simultaneous  targets,  reported 
by  Eriksen  ( 1962,  1953)  provide  additional 
support  for  this  interpretation  of  Lehito's 
data.  Thus,  two  different  scanning 
strategies  are  suggested.  One  requires 
element  by  element  inspection;  the  other 
allows  for  sweeping  across  the  array.  If 
they  are  thought  of  as  opposite  ends  of  a 
continuum,  then  variations  of  strategy 
might  be  in  the  number  of  elements  to 
be  inspected  as  a  unit. 

A  second  dependent  measure  of  interest 
is  the  scanning  rate,  or  its  inverse 
^alue,  time  per  element  of  the  display, 
where  an  element  is  one  symbol  or  stimuulus 
Time  per  element  was  calculated  for  the 
studies  shown  in  Figure  1  and  plotted  in 
Figure  2.  The  figure  shows  very  clearly 
that  on  a  log-log  plot  this  dependent 
measure  decreases  linearly  as  the  number 
of  stimuli  in  the  array  increases.  The 
generality  of  the  dependent  measure  is 
limited,  of  course,  since  the  arithmetic 


involved  would  produce  a  decreasing 
measure  even  if  the  search  time  were 
constant  over  the  whole  range  of  the 
abscissa.  That  was  not  the  case,  al¬ 
though  Figure  1  suggested  that  the 
search  time  may  be  constant  up  to  a 
critical  number  of  stimuli. 

Figure  2  indicates  that  the 
observer  appears  to  be  capable  of 
processing  discrete  elements  at  a 
rate  of  less  than  .0007  second  per 
element.  If  rates  of  that  order  are 
within  the  individual's  processing 
rate,  then  Figure  2  may  be  interpreted 
as  conceivably  representing  an  increase 
in  the  speed  of  serial  processing. 

On  the  other  hand,  to  the  extent  that 
the  values  are  unrealistic.  Figure  2 
presents  an  artifact  due  to  the  arith¬ 
metic  involved  in  calculating  the 
dependent  measure.  Ignoring  this 
question  for  the  moment,  and  assuming 
that  Figure  2  is  meaningful,  the 
figure  indicates  that  as  the  number  of 
stimuli  in  the  display  increases,  the 
time  spent  per  element  decreases  to  a 
limiting  value  after  which  it  remains 
constant.  The  limiting  value  appears 
to  depend  upon  the  nature  of  the  stimuli 
used.  In  fact  essentially  the  same 
limit  appears  to  have  been  reached 
for  the  two  middle  trends  of  the  figure. 
Inspection  of  Figure  1  shows  that 
search  time  increased  with  a  positive 
acceleration  so  that  both  trends  are 
already  almost  asymptotic  to  an  infinite 
search  time.  Looking  at  both  Figure  1 
and  Figure  2  for  those  studies,  it 
seems  clear  that  the  increasing 
search  time  is  actually  due  to  an 
increasing  number  of  stimuli  to 
process  at  a  constant  time  per  stimulus. 

Figure  2  can  be  approached  with 
more  than  one  hypothesis.  It  can  be 
supposed  that  after  some  critical 
number  of  stimuli,  the  observer  shifts 
from  serial  to  parallel  processing. 

The  two  extreme  trends  at  the  high 
end  of  the  abscissa  would  then  sug¬ 
gest  that  for  tho®  tasks  the  shifting 
point  had  not  been  reached.  The  flat 
portions  of  those  results  could  be 
interpreted  as  what  happens  when  the 
shift  occurs.  The  same  change,  unfor¬ 
tunately,  is  equally  plausible  as  what 
happens  when  there  is  a  shift  in  the 
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Figure  2.  Time  per  Displayed  Stimulus  Element  (symbol)  as  a  Function  of  the  Half 

Number  of  Stimuli  in  the  display. 


grouping  or  coding  of  multiple  elements 
in  the  display.  That  is,  as  Vol,mann  et 
al  (196*+)  have  suggested,  the  individual 
inspects  each  element  of  the  display 
until  a  critical  display  number  is  reached 
after  which  the  stimuli  are  scanned  in 
groupings  of  some  sort. 

Volkmann  et  al  (196*+)  have  reported 
that  as  the  density  of  the  stimuli  increased 
in  their  experiments,  the  search  time 
also  increased.  Density  was  calculated  as 
the  total  number  of  stimuli  displayed 
divided  by  the  visual  angle  of  the  field 
in  degrees.  Our  plots  of  search  time  as 
a  function  of  density  and  time  per  element 
as  a  function  of  density  produced  graphs 


which  were  very  similar  to  Figures  1 
and  2.  For  that  reason  only  the  latter 
will  be  presented. 

Figure  3  relates  time  per  element  to 
density.  As  noted  the  figure  closely 
approximates  Figure  2.  One  important 
discrepancy  should  be  noted,  however. 

Of  the  four  studies  which  used  alpha¬ 
numeric  displays,  three  used  identical 
densities  and  one  was  almost  at  that  den¬ 
sity.  Whereas  in  Figure  2,  three  of 
those  studies  were  close  to  the  trends 
of  the  other  data  and  one  was  not,  all 
four  are  now  away  from  those  trends. 
Furthermore,  those  four  studies  may 
now  be  seen  to  have  provided  a  wide  range 
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of  values  of  time  per  element.  Thus,  at 
least  for  the  data  being  used.  Figure  3 
does  not  provide  support  for  density  as  an 
independent  variable.  Instead  it  suggests 
that  it  may  appear  to  be  such  a  variable 
only  because  of  a  spurious  correlation 
with  the  dependent  measufe  via  its  own 
dependence  on  number  of  stimuli. 

The  data  shown  in  the  previous 
figures  were  based  upon  searching  for  a 
known  target  when  there  were  no  alterna¬ 
tive  target  possibilities.  Figure  k  pre¬ 
sents  search  time  as  a  function  of  the 
number  of  possible  different  targets  that 
could  appear  when  only  one  of  those  tar¬ 
gets  is  in  the  display.  The  parameter  is 
the  half  number  of  stimuli  in  the  display. 

It  may  be  seen  that  for  a  constant  number 
of  displayed  stimuli,  search  time  increases 
as  the  number  of  possible  targets  increases. 
However,  the  data  are  not  very  well-ordered 
in  terms  of  the  parameter,  number  of  stim¬ 
uli. 

Figure  5  relates  time  per  element 
to  the  number  of  possible,  different 
targets  for  the  same  data.  With 
regard  to  the  parameter,  although  the 


ordering  is  far  from  completely  con¬ 
sistent,  it  is  more  systematic  than 
it  was  on  the  previous  figure.  It 
is  also  interesting  that  the  largest 
arrays  are  now  at  a  constant  time 
per  element,  i.e.  they  are  indepen¬ 
dent  of  the  number  of  possible  tar¬ 
gets.  The  small,  8-element  array,  on 
the  other  hand,  still  shows  an  increas¬ 
ing  function  of  the  number  of  possible 
targets. 

Figure  5  is  a  little  misleading 
in  that  time  per  element  is  plotted 
to  the  nearest  .01  second  whereas  both 
Cohen  and  Pew  (1970)  and  Neisser, 

Novick  and  Lazar  ( 1963)  reported 
their  data  to  the  nearest  .001  sec. 

At  that  greater  level  of  precision, 
their  data  also  show  an  increasing 
time  per  element  as  the  number  of 
possible  targets  increases.  The 
data  being  shown  from  those  two  studies 
are  from  the  fifth  day  of  practice. 

The  data  of  Kaplan  and  Carvellas 
(1965)  are  based  upon  any  two  runs 
through  the  series  in  one  experimen¬ 
tal  session.  By  the  tenth  day 
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Figure  3.  Time  per  Element  as  a  Function  of  Stimulus  Density 
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Neisser,  Novick,  and  Lazar  found  that 
the  time  per  element  was  the  same 
regardless  of  the  number  of  possible 
targets  at  least  up  to  10.  Cohen  and 
Pew  did  not  find  that  result  through 
15  days  of  practice  although  with 
succeeding  days  the  differences  in 
time  per  element  associated  with  the 
number  of  possible  targets  became 
markedly  less.  Presumably  the  differ¬ 
ences  between  those  two  studies  are 
due  to  the  major  experimental  differ¬ 
ence  in  procedure,,  i.e.  instructions 
which  emphasized  speed  (Neisser, 

Novick  and  Lazar)  or  which  emphasized 
accuracy  (Cohen  and  Pew).  Regardless, 
the  results  suggest  that  with  enough 
practice  the  effects  of  differing 
numbers  of  possible  targets  are  lost. 

This  suggestion  finds  very  strong 
support  in  studies  of  choice  reaction 
time  which  show  the  same  phenomenon 
although  only  after  much  greater 
amounts  of  practice  (e.g.  Mowbray 
and  Rhoades,  1959) • 

The  visual  search  situation  may  also 
be  analyzed  in  information  theoretic 
terms  and  this  has  been  done  for  the  data 
of  Green  and  Anderson  (1956)  by  Meudell 
and  Whiston  (1970)  with  considerable 
success.  Such  an  analysis  may  be  based 
upon  two  different  underlying  search 
models.  The  first  assumes  that  the 
individual  is  a  random  searcher.  The 
presentation  of  elements  of  the  display 
to  the  searcher  t;hen  depends  upon  a 
random  positioning  of  the  eyes  during 
scanning.  In  fact,  data  from  studies  of 
eye  movements  during  search  suggest 
that  although  the  pattern  of  eye  move¬ 
ments  may  be  erratic  it  is  far  from  ran¬ 
dom.  A  second  factor  involved  in  the 
particular  studies  that  we  are  consider¬ 
ing  is  that  in  all  of  the  studies  using 
alphanumeric  symbols  the  subjects  were 
instructed  to  search  sequentially  in 
the  same  manner  as  used  for  normal 
reading.  Furthermore,  it  is  a  reasonable 
hypothesis  that  the  eye  movement  sequenc¬ 
ing  in  the  other  studies  was  at  least 
strongly  influenced  by  reading  habits. 

On  this  basis  a  random  search  model 
seems  less  likely  to  represent  what 
happens  than  a  model  which  assumes  a 
systematic  search.  In  fact,  that  was 
the  case  in  our  analysis  of  these  data 
and,  therefore,  only  the  results  obtained 
with  the  systematic  search  model  will 
be  presented. 


A  systematic  search  model  assumes 
that  the  subject  inspects  each  element 
of  the  display  in  serial  order.  On 
inspection  he  makes  a  one  bit  decision, 
i.e.  that  the  element  is  either  in  the 
target  or  in  the  non-target  classifica¬ 
tion.  If  it  is  in  the  target  classifica¬ 
tion,  then  he  must  make  a  further  decision 
about  which  target  of  the  possible  alter¬ 
native  targets  it  is.  In  general,  the 
amount  of  information,  H(X),  involved  is: 

H(X)  =  N  +  m  log?n  (l) 

where  N  is  the  number  of  possible  loca¬ 
tions  or  the  number  of  symbols  in  the 
display  if  all  possible  locations  are 
used. 

m  is  the  number  of  possible  target 
alternatives 

n  is  the  number  of  elements  or  items 
in  the  source  of  targets  and  non¬ 
targets 

As  noted  by  Meudell  and  Whiston 
(1970)  when  the  probabilities  of  the 
individual  items  in  the  source  vary, 
the  more  accurate  calculation  of  the 
amount  information  is : 

H(X)  =  N  +  m  s  P.log  l/p  (2) 
i=l 

where  p^  is  the  probability  per 
source  element.  However,  as  they 
also  note,  and  for  the  data  involved 
here,  the  error  involved  in  using 
Equation  1  when  the  probabilities 
are  unequal  is  very  small.  Further¬ 
more,  for  the  present  data  it  is  not 
always  possible  to  be  sure  of  what 
the  probabilities  should  be.  For 
example,  in  some  alphanumeric  studies 
sources  were  established  for  experi¬ 
mental  purposes  which  were  selec¬ 
tions  of  letters  or  letters  and  digits. 
Displays  were  then  constructed  from 
those  sources.  But,  it  is  reasonable 
to  assume  that  the  subjects  viewed 
those  sources  as  if  they  contained 
all  26  letters  or  all  letters  and 
digits,  and  as  if  all  source  elements 
were  equi-probable .  The  analysis 
to  follow  made  that  kind  of  subject- 
oriented  assumption. 

Figure  6  represents  search  time 
as  a  function  of  stimulus  information 
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for  all  of  the  data  of  the  previous 
figures  and,  in  addition,  for  the 
results  of  three  additional  studies 
fll  of  which  involved  more  than  one 
possible  target.*  Again,  it  may  be 
seen  that  there  is  an  increase  in 
the  dependent  measure  as  the  indepen¬ 
dent  measure  increases  in  magnitude 
and  that  more  than  one  trend  is  evident. 
Also,  although  the  calculation  accounted 

*Egeth,  Jonides  and  Wall  (1972)  pre¬ 
sented  an  array  of  letters  within  which 
was  placed  a  single  digit  as  the  tar¬ 
get.  The  subject  was  instructed  to 
report  any  digit  as  a  target.  Thus, 
the  search  was  not  for  one  of  several 
possible  digits,  but  of  digit  as  a 
category.  Accordingly,  although  the 
Investigators  selected  from  eight 
digit  possibilities,  we  viewed  the 
stimulus  source  as  a  one-bit  source, 
i.e.  letter  or  digit. 


for  variations  in  the  number  of  possible 
targets,  the  results  of  Cohen  and  Pew 
do  not  fall  on  any  of  the  trends.  Again, 
this  is  presumably  because  of  the  emphasis 
on  accuracy  which  they  employed.  Except 
for  the  data  of  Lehtio,  as  before,  there 
is  a  general  organization  of  the  data  into 
what  appear  to  be  three  major  trends.  It 
is  difficult  to  see  what  distinguishes 
among  the  trends,  however. 

Figure  7  presents  a  plot  of  the  data 
showing  the  relationship  between  time 
per  element  and  amount  of  stimulus  infor¬ 
mation.  The  measures  are  plotted  to  the 
nearest  .01  sec.  which  accounts  for  the 
large  number  of  points  which  are  close  to 
zero  seconds.  Again  the  data  of  Lehtio 
are  out  of  line  with  the  rest  of  the  data. 
Now,  however,  it  can  be  seen  that  all  of 
the  other  data,  with  the  exception  of 
those  of  Kaplan  and  Carvellas,  fall  on 
what  appears  to  be  a  single  trend.  The 


110 

10.  D 

9.D 

8.0 

7.0 

G 

W 

£  6.0 

P 

x 


§ 


5.0  -+ 
4.0 

3.0 

2.D 

l.D 


DIGIT  • 


SIZE  • 


V  KAPLAN  &  CARVELLAS.  1965 

■  NEISSER,  NOVICK.  &  UZAR,  1963 

(LETTER  AMONG  LETTERS) 

(LETTERS  AND  DIGITS) 

<>  NEISSER,  1963 

A  COHEN  &  PEW,  1970 

(LETTER  AMONG  LETTERS) 

(LETTERS  AND  DIGITS) 

□  V0LKMANN.  CORBIN.  EDDY,  &  COONLEY,  1964 

YEGETH,  JONIDES,  &  WALL.  1972 

(BLACK  TRIANGLE  AMONG  BLACK  CIRCLES) 

(DIGIT  AMONG  LETTERS) 

HVOLKMANN,  CORBIN.  EDDY,  &  COONLEY,  1964 

•  LEHTIO,  1970 

(RED  CIRCLE  AMONG  GREEN  CIRCLES 

(DIGIT,  COLOR,  SIZE) 

A  V0LKMANN,  CORBIN,  E00Y,  &  COONLEY,  1964 

♦  ERICKSON,  1964 

(URGE  BUCK  CIRCLE  AMONG  SMALL 

(UNDOLT  RING  AMONG  CIRCLES) 

BUCK  CIRCLES 

A  CRAWFORD,  1962 

O  SMITH,  1961 

(OPAL  VS.  RED  AND  GREEN) 

(SQUARE  AMONG  CIRCLES) 

®  BRODY,  CORBIN  &  V0LKMANN,  I960 
(BUCK  TRIANGLE  AMONG  BUCK 
CIRCLES) 

O 

0 


& 


•  COLOR 


4^3 

V 


4  42. .  • 

<^,8  POSSIBLE 


O* 

■  B 


O*-1 


5  POINTS 

i - r 

0  20 


40 


6D 


80 


“4 - H 


100 


120  140  160  180  200 

STIMULUS  INFORMATION  (BITS) 


H - H- 


300 


■MM 


-4*4- 


500  600  700  800  900  1000  2000 


Figure  6.  Search  Time  as  a  Function  of  Stimulus  Information  Content 
Numerals  Refer  to  the  Number  of  Possible  Different  Targets 


159 


L0-- 


V  KAPLAN  &  CARVEUAS,  1965 
(LETTER  AMONG  LETTERS! 

O  NEISSER,  1963 

(LETTER  AMONG  LETTERS! 

□  VOLKMANN,  CORBIN,  EDDY,  &  COONLEY,  1964 
(BLACK  TRIANGLE  AMONG  BLACK  CIRCLES 
H  VOLKMANN,  CORBIN,  EOOY,  &  COONLEY,  1964 
(REO  CIRCLE  AMONG  GREEN  ClRCLESI 
A  VOLKMANN,  CORBIN,  EOOY,  &  COONLEY,  1964 
(LARGE  BLACK  CIRCLE  AMONG  SMALL 
BLACK  ClRCLESI 
O  SMITH,  1961 

(SQUARE  AMONG  CIRCLES) 


■  NEISSER,  NOVICK,  &  LAZAR,  1963 
(LETTERS  AND  DIGITS) 

▲  COHEN  &  PEW,  1970 

(LETTERS  ANO  OIGITS) 

▼  EGETH,  JONIDES,  &  WALL,  1972 
(DIGIT  AMONG  LETTERSI 
•  LEHTlti,  1970 

(DIGIT,  COLOR,  SIZEI 
+  ERICKSON,  1964 

(LAN DOLT  RING  AMONG  CIRCLES) 
A  CRAWFORD,  1962 

(OPAL  VS.  REO  ANO  GREEN) 

®  BRODY,  CORBIN  &  VOLKMANN,  I960 
(BLACK  TRIANGLE  AMONG  BLACK 
ClRCLESI 


•  OIGIT 

•  SIZE 


•  COLOR 


1  2  3  4  5 

-pft  V  4  K 


TTT  a 


+ 

0 


i 


4- 

60 


1 


* 


WO  160  li  200  2^0  2^0  A  3^0  40D^  5(Jo  6^0  7(io  8(^0  900  loIxHSOO  2000 

STIMULUS  INFORMATION  (BITSI 


Figure  7.  Time  per  Element  as  a  Function  of  Stimulus  Information  Content 
Numerals  Refer  to  the  Number  of  Possible  Different  Targets 


smooth  line,  drawn  by  eye,  ignores  the 
results  of  Kaplan  and  Carvellas  and  of 
Lehtio.  The  function  shows  an  extremely 
rapid  decrease  in  time  per  element  to  35 
bits  of  stimulus  information  at  which  point 
the  time  per  element  is  .02  seconds.  The 
slight  decrease  in  time  per  element  after 
that  is  again,  presumably,  the  artifact 
noted  earlier  in  relation  to  Figure  3* 

This  result  suggests  a  maximum  rate  of  50 
elements  per  second  at  a  display  complex¬ 
ity  of  35  bits,  assuming  serial  searching. 
It  should  also  be  noted  that  again  the 
results  of  Lehtio  are  out  of  line,  though 
closer.  We  have  no  explanation  for  the 
reversal  in  trend  of  the  data  of  Kaplan 
and  Carvellas. 

Figure  8  presents  the  scanning  rate 
as  a  function  of  stimulus  information. 

The  points  were  obtained  by  reading  the 
smooth  line  of  the  previous  figure  and 
converting  the  values  obtained  to  their 
reciprocals.  The  figure  confirms  the 
conclusion  of  the  previous  figure,  but 
with  additional  emphasis.  That  is,  the 
curve  rises  steeply  to  5 O  elements  per 
second  at  which  point  it  reaches  a 


limit.  After  that  it  is  constant  to  about 
55  bits  where  it  begins  to  rise  again.  Had 
we  been  willing  to  plot  time  per  element 
to  the  fourth  decimal  place,  it  would 
have  been  rising  slowly,  but  continu¬ 
ously  throughout.  Thus,  the  rise  begin¬ 
ning  at  55  bits  represents  a  change 
detectable  at  the  second  decimal  place. 

But  as  we  have  suggested  repeatedly, 
the  mere  addition  of  large  numbers  in 
the  denominator  of  the  quantity,  search 
time/half  number  of  stimuli  =  time  per 
element,  will  produce  continuously 
decreasing  times  per  element  even  if 
search  time  is  constant.  Thus,  we 
cannot  accept  the  practice  of  reporting 
time  per  element  to  the  third  and 
fourth  decimal  places  as  the  measure 
then  becomes  over  sensitive  to  the 
artifact.  Accordingly,  we  interpret 
Figure  8  as  suggesting  a  maximum 
scanning  rate  or  scanning  capacity 
of  50  elements  per  second.  It  should 
also  be  noticed  that  the  rate  increases, 
the  more  the  information  to  be  process¬ 
ed  up  to  that  limit.  Finally,  in 
this  connection,  the  curve  appears 
to  be  discontinuous  at  the  lower  end, 
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i.e.  the  rates  for  two-and  three- 
bit  stimulus  situations  do  not  fall 
neatly  on  the  remaining  trend.  We 
have  omitted  them  from  the  smooth 
line,  therefore.  But  whether  this 
result  should  be  viewed  as  error 
or  as  representing  a  different  phen¬ 
omenon  is  not  suggested  by  the  analysis. 

The  error  rate  of  Neisser  (1963) 
and  of  Neisser,  Novick  and  Lazar  ( 1963) 
exceeded  our  five  percent  criterion 
for  the  acceptance  of  experimental 
studies  by  a  few  percent.  Assuming 
that  the  error  rates  of  all  of  the 


studies  are  really  very  small  of  negli¬ 
gible,  and  by  assuming  that  element 
information  is  what  is  being  trans¬ 
mitted,  the  line  of  Figure  8  can  be 
transformed  to  represent  the  rate  of 
information  transmission.  The  result  of 
that  transformation  is  shown  in  Figure  9. 

The  line  of  Figure  8  was  trans¬ 
formed  over  the  range  from  4-35 
bits  of  stimulus  information  and  drawn 
as  the  solid  line  in  Figure  9*  The 
two  unconnected  points  in  the  figure 
are  transformations  of  the  two-  and 
three-bit  values  of  Figure  8.  The 


Figure  8.  Scanning  Rate  as  a  Function  of  Stimulus 
Information  Content 


Figure  9.  Information  Processing  Rate  as  a  Function  of 
Stimulus  Information  Content 
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dashed  line  is  an  extrapolation  of  the 
solid  line  to  zero. 

Again  we  cannot  account  for  the 
deviation  of  the  two  and  three-bit 
values  from  the  main  trend.  In  spite  of 
that  it  is  clear  that  the  processing  rate 
increases  as  the  stimulus  information 
content  increases.  The  line  indicates  a 
maximum  processing  rate  or  capacity  of 
5-64  bits  per  second. 

DISCUSSION 

In  spite  of  the  small-  number  of 
experiments  available  which  met  our 
requirements,  certain  empirical  relation¬ 
ships  appear  to  have  emerged  from  the 
data  and  some  theoretical  notions  appear 
to  be  suggested.  On  the  empirical  side, 
the  following  conclusions  appear  to  be 
warranted : 

1.  Search  time  for  a  target  in  an 
array  increases,  but  the  rate  of 
searching  also  increases  (and 
time  per  element  decreases)  as 

the  number  of  stimuli  in  the  array 
increases.  The  increase  in 
search  time  is  slow  or  even 
negligible  at  first  and  then 
increases  rapidly.  The  inflec¬ 
tion  point  and  the  rate  of  rise 
after  it  may  depend  upon  the 
difficulty  of  discrimination 
between  target  and  non-target 
elements.  However,  search  rate 
varies  directly  with  the  number 
of  stimuli  in  the  display  until 
a  maximum  rate  is  reached.  The 
maximum  rate  depends  upon  the 
same  factors  as  those  which  con¬ 
strain  search  time. 

2.  Search  time  increases  as  the 
number  of  alternative  possible 
targets  increases.  Up  to  some 
level  of  practice,  the  rate  of 
searching  also  increases  with 
the  number  of  Possible  targets. 
After  that  level  of  practice  is 
reached,  the  number  of  alterna¬ 
tive  targets,  at  least  up  to  10, 
probably  makes  no  difference  in 
search  rate. 

3.  Search  time  is  greater,  when  the 
emphasis  is  put  by  the  searcher 


on  accuracy  of  target  selec¬ 
tion. 

4.  For  the  kind  of  situations 
involved,  density  (number  of 
stimuli  per  unit  visual  area) 
may  be  only  spuriously 
related  to  search  performance. 

5*  Time  per  element  decreases 
as  the  amount  of  stimulus 
information  in  the  display 
increases  to  a  minimum  time 
per  element  of  about  .02 
second.  Concurrently, 
scanning  rate  increases  to  a 
maximum  of  50  elements  per 
second. 

6.  The  rate  of  information  trans¬ 
mission  increases  with  increas 
ing  stimulus  information  to 

a  limit  of  5* 6  bits  per  second 

7.  It  is  possible  that  the 
effects  of  stimulus  infor¬ 
mation  content  on  scanning 
rate  and  information  trans¬ 
mission  are  quantitatively 
different  below  four  bits  of 
stimulus  information  from 
what  it  is  above  four  bits. 

On  the  theoretical  side  it  seems 
apparent  that  a  systematic  serial 
search  model  may  be  appropriate  to 
data  of  the  kind  used.  Such  data  are 
from  studies  with  no  temporal  uncer¬ 
tainty,  with  well-defined  possible 
target  locations,  but  with  uncertainty 
about  the  location  at  which  the  target 
will  be.  The  results  of  Meudell  and 
Whiston  (1970)  also  indicate  that  the 
sequential  search  model  holds  for 
more  complex  alphanumeric  search 
tasks.  The  results  also  make  it 
fairly  clear  that  search  time  as  a 
measure  is  less  useful  than  scan 
rate  since  search  time  depends  not 
only  on  the  rate,  but  on  the  number 
of  positions  or  symbols  to  be  inspected 

It  also  appears  that  the  conclu¬ 
sions  suggested  are  restricted  to 
cases  where  the  target  differs  from 
non-target  stimuli  by  a  single 
stimulus  attribute,  and  where  only 
one  target  is  present  in  the  display. 
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Where  attributes  vary  beyond  that,  scan 
rate  and  search  time  are  considerably 
slower . 

There  has  been  a  great  deal  of  inter¬ 
est  lately  in  the  question  of  whether  the 
search  for  multiple  target  possibilities 
is  carried  out  in  a  serial  or  parallel 
fashion.  Neisser,  Novick  and  Lazar  (1963) 
have  suggested  a  serial  process  during 
learning  of  the  task,  but  a  parallel 
process  with  sufficient  practice.  The 
basis  of  their  conclusion  is  their  finding 
that  with  sufficient  practice  the  rate  of 
searching  became  independent  of  the  number 
of  target  possibilities.  If  the  data 
are  used  only  to  the  nearest  .01  second 
even  the  data  of  Cohen  and  Pew  (1970) 
support  that  finding,  and  as  we  have 
already  noted,  such  a  finding  is  consistent 
with  the  effects  of  multiple  alternatives 
in  choice  reaction  time  studies  after 
enough  practice.  Whether  or  not  parallel 
processing  is  involved  is  not  suggested 
by  our  analysis.  However,  the  analysis 
does  suggest  a  scanning  capacity  of  50 
elements  per  second  for  the  conditions 
studied.  The  question  now  arises  as  to 
whether  that  rate  is  achieved  because  of 
the  availability  of  parallel  processing 
or  whether  a  moderately  high  speed  sequencer 
is  involved.  In  any  case,  since  the  data 
represent  the  simplest  cases,  and  therefore, 
since  other  search  situations  involving 
symbols,  shapes,  etc.  will  be  slower,  that 
rate  is  suggested  as  the  maximum  scanning 
rate  to  be  used  for  practical  purposes. 

Although  we  really  have  only  the  one 
study  of  Lehtio  to  go  on,  when  those 
results  are  compared  with  the  rest,  there 
is  the  suggestion  of  a  difference  in 
searching  strategy  discussed  earlier. 

That  is,  earlier  we  attempted  to  distin¬ 
guish  between  a  one-step,  between -element 
search  and  a  two-step,  both  between-and 
within-elements  search.  It  might  be 
supposed  that  which  will  be  used  will 
depend  upon  the  degree  of  homogeneity 
of  the  stimuli.  If  so,  the  results  do 
not  suggest  that  homogeneity  in  a  physical 
sense  is  what  is  involved  although  it 
could  be  an  aiding  factor.  This  appears 
to  be  the  case  since  the  physically  heter¬ 
ogenous  alphanumeric  arrays  yielded  search 
times  of  the  same  order  as  one  of  the  highly 
homogeneous  arrays. 

A  more  interesting  hypothesis  is  of  a 
psychological  homogeneity  which,  for  highly 


familiar  stimuli,  may  be  determined 
by  the  degree  to  which  stimuli  fall 
into  the  same  psychological  category. 
Accordingly  searching  for  a  letter  among 
letters  is  more  heterogeneous  than 
searching  for  a  digit  among  letters. 

It  is  interesting  to  note  that  in  the 
study  by  Egeth,  Jonides  and  Wall  (19^2), 
the  target  could  be  any  digit.  The 
display  was  one  digit  in  an  array  of 
letters.  That  this  study  produced 
data  which  were  among  the  shortest 
search  times  reported  could  be  viewed 
as  searching  in  a  two-classification 
array  for  digits  vs.  letters,  each 
letter  being  psychologically  the 
same  as  any  other  letter,  each  digit 
being  psychologically  the  same  in  its 
own  category  as  target.  The  point  is 
that  once  coded,  digit  vs.  letters 
is  psychologically  no  more  heterogeneous 
than  triangle  vs.  circles  even  though 
the  difference  in  physical  hetero¬ 
geneity  is  very  large.  Our  analysis 
indicates  strongly  that  it  is  the 
psychological  heterogeneity  which  is 
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ABSTRACT 

Current  high-performance  aircraft 
achieve  speeds  which  exceed  man's  ability 
to  visually  acquire  a  target  before  it  is 
overflown.  We  are  faced  with  the  problem 
of  extending  man's  capability  to  match 
that  of  the  aircraft.  Electronic  sensors 
such  as  radar  and  TV  systems  offer  poten¬ 
tial  solutions,  however,  considerable 
research  into  the  best  utilization  of 
these  systems  is  still  required.  McDonnell 
Aircraft's  approach  to  this  research  has 
been  to  design  and  build  a  variable  config¬ 
uration  display  mockup  (VCDM)  in  which 
actual  flight  images  can  be  electro- 
optically  manipulated  to  measure  perfor¬ 
mance  and  evaluate  system  improvements. 

To  maintain  our  operational  orientation, 
stimulus  imagery  was  generated  using  aerial 
imagery  from  the  McDonnell  Douglas  Recon¬ 
naissance  Laboratory  data  base,  containing 
strategic  and  tactical  targets. 

Moving  Wirdow  Displays 

In  these  systems,  an  area  on  the 
ground  is  imaged  and  moves  down  the  display 
at  a  rate  proportional  to  the  speed  of  the 
aircraft.  A  series  of  studies  was  per¬ 
formed  to  evaluate  the  effects  of  image 
motion  generated  by  various  aircraft  speeds. 
Imagery  was  viewed  on  the  5.5  inch  CRT  dis¬ 
play  in  the  cockpit  station  of  the  VCDM, 
at  image  motion  rates  yielding  from  1  to  6 
seconds  on  the  display  and  simulated  air¬ 
craft  speeds  of  from  675  to  114  knots. 
Overall,  the  data  indicates  that  a  moving 
display  can  be  used  for  the  acquisition  of 
"easy"  targets  (large  truck  parts,  fortifi¬ 
cations,  storage  area)  at  speeds  resulting 
in  as  little  as  one  second  on  the  display. 


For  all  practical  purposes,  the  relation¬ 
ship  between  image-motion  and  time-on- 
display  limits  the  use  of  moving  window 
displays  to  missions  against  large  tar¬ 
gets  or  those  allowing  slow  aircraft 
speeds . 

Stabilized  Image  Displays 

In  these  systems  the  sensor  tracks  an 
area  on  the  ground  so  that  a  stationary 
image  is  displayed  to  the  operator.  As 
the  aircraft  closes  with  the  target  the 
image  scale  increases,  producing  a  zoom 
effect.  Since  the  system  depends  on  track¬ 
ing  a  fixed  point,  it  can  only  be  used  for 
targets  of  known  location,  or  where  a 
specific  ground  area  needs  to  be  searched. 
The  variables  manipulated  were:  closing 
rate  (360  to  1200  knots),  starting  field 
of  view  (100%  vs  66%  of  original  image), 
target  offset  from  the  center  of  the  dis¬ 
play  (center,  middle,  edge)  and  type  of 
briefing  aid  (photo  or  sketch  of  target). 
This  series  of  studies  indicated  that  tar¬ 
gets  can  be  successfully  acquired  on  a 
stabilized  image  display  at  closing  rates 
as  high  as  1200  knots,  provided  the  navi¬ 
gation  system  is  adequate  to  place  the 
target  in  the  center  two-thirds  of  the 
field  of  view,  and  appropriate  briefing 
aids  are  available. 

In  summary,  at  moderate  speeds,  moving 
window  displays  are  effective  against  large 
targets,  while  stabilized  image  displays 
are  effective  against  targets  of  known 
location  at  much  higher  speeds.  This  data 
contributes  towards  the  definition  of  the 
aircraft  envelope  and  the  mission  types 
within  which  man  can  effectively  utilize 
these  sensor/display  systems. 
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SCOPE  OF  PROBLEM 

The  human  operator  places  severe 
limitations  on  the  usable  portion  of  the 
flight  envelope  of  high  performance  air¬ 
craft  engaged  in  air-to-ground  target 
acquisition.  These  limitations  are  cen¬ 
tered  around  difficulties  in  visual  acqui¬ 
sition  of  ground  targets.  Laboratory  stud¬ 
ies  indicate  that  a  target  size  of  from  10 
to  20  minutes  of  arc  is  required  for  ade¬ 
quate  (80*  or  better)  target  acquisition 
(Erickson,  Main  &  Burge,  1967;  Steedman  & 
Baker,  I960;  Boynton  &  Bush,  1955;  Levine, 
Jauer  &  Kozlowski,  1969).  Operationally 
this  translates  to  an  initial  visual  acqui¬ 
sition  range  of  3,300  to  10,000  ft.  for  a 
20-30  ft.  target.  Operational  and  field 
studies  obtained  initial  visual  acquisition 
ranges  of  from  5,000  to  11,000  ft.  for 
vehicles  in  the  20-30  ft.  size  range 
(Blackwell,  Omart  &  Harcum,  1959;  Dyer, 
1964;  Wade,  1964).  Allowing  for  the  dif¬ 
ference  in  target  acquisition  procedure, 
altitude,  and  velocity,  the  laboratory  and 
the  operational  results  are  in  good  agree¬ 
ment. 

Relating  these  acquisition  ranges  to 
operational  situations,  an  aircraft  with  a 
velocity  of  300  knots  will  cover  5,000  ft. 
in  approximately  10  seconds.  Allowing  for 
the  best  case  acquisition  range  (11,000 
ft.),  at  this  velocity  the  aircrew  will 
have  only  22  seconds  between  target  acqui¬ 
sition  and  target  flyover.  During  this 
time  the  aircraft  must  be  maneuvered  into 
firing  position,  the  target  must  be 
acquired  on  the  weapons  guidance  system, 
lockon  achieved  and  weapons  launch  ini- 
ti ated . 

An  unclassified  summary  of  time  line 
analyses  of  the  critical  weapons  delivery 
functions  for  a  number  of  mission  configu¬ 
rations  using  a  typical  lockon  before 
launch  missiles  showed  that  an  average  of 
19  seconds  was  required  for  launch  (Dolce, 
Whiteside  &  Wright,  1970).  Using  this 
data  as  a  boundary  condition,  man's  enve¬ 
lope  was  plotted  against  altitude  and 
velocity  to  allow  a  direct  comparison  with 
the  operational  envelope  of  a  typical  high 
performance  attack  aircraft  (Figure  1). 

This  comparison  clearly  shows  that  the  air¬ 
craft  potential  greatly  exceeds  man's 
capability.  The  designers  of  real-time 
and  near-real -time  reconnaissance  and 
reconnaissance/strike  systems  are  faced 
with  the  problem  of  extending  man's  target 


acquistion  capability  to  approach  the  air¬ 
craft  capability.  The  majority  of  the 
research  on  extending  man's  performance 
envelope  has  centered  on  real-time  and 
near-real-time  displays  utilizing  sensing 
aids  such  as  radar  and  long-focal  length 
electro-optical  systems.  These  systems 
significantly  extend  man's  target  acquisi¬ 
tion  range  and  have  the  added  benefits  of 
recording  an  expanded  energy  spectrum  and 
a  capability  for  automated  data  processing 
However,  they  also  bring  up  a  new  series 
of  problems  relating  to  system  utilization 
and  the  observer's  ability  to  acquire  tar¬ 
gets  from  the  displays  system  outputs. 
These  problems  can  be  summarized  as  the 
selection  of  sensor  systems  and  the  dis¬ 
play  of  their  outputs  in  a  way  which  maxi¬ 
mizes  target  acquisition  and  acquisition 
range  and  minimizes  time  to  acquire  and 
false  alarms.  Because  of  the  large  number 
of  variables  involved,  this  task  is  far 
more  difficult  than  it  would  seem.  Each 
element  in  the  system  can  be  analyzed  and 
evaluated  with  respect  to  its  own  set  of 
variables  and  to  some  extent  the  effects 
which  these  variables  have  on  the  system 
elements  interfacing  with  it  (Figure  2). 
For  example,  the  target/background  con¬ 
trast  on  the  ground  and  the  sensor  sensi¬ 
tivity  will  determine  sensor  output;  the 
sensor  output  and  the  display  capability 
will  determine  target/background  contrast 
on  the  display;  and  in  turn  the  target/ 
background  contrast  on  the  display  and  the 
observer's  visual  capability  will  deter¬ 
mine  the  perceived  target/brightness  con¬ 
trast.  The  acquisition  systems  may  be 
studied  by  relating  variables  from  any 
system  element  with  any  of  the  others,  how 
ever,  in  doing  this  all  intermediate  ele¬ 
ments  must  also  be  evaluated.  This 
approach  results  in  complex  analyses  whose 
results  are  often  equivocal  and  difficult 
to  apply. 

In  the  course  of  our  research  we  have 
evolved  three  concepts  to  aid  in  the  study 
of  the  acquisition  problem  and  reduce  some 
of  the  complexity. 

First,  the  experiments  should  concen¬ 
trate  on  the  display  image/observer  inter¬ 
face  (Figure  3).  This  interface  -  where 
the  eyeball  meets  the  display  -  has  ele¬ 
ments  of  both  the  hardware  and  observer 
elements  of  the  system.  The  observer's 
response  defines  system  performance  in 
terms  of  targets  acquired,  and  provides  a 
direct  measure  of  system  goodness.  The 
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display  image  represents  the  end  result  of 
the  hardware  part  of  the  system.  Once  a 
display  has  been  identified  which  produces 
adequate  target  acquisition  it  is  possible 
to  work  backwards  to  select  values  of 
other  hardware  elements  which  produce  the 
required  display.  This  yields  a  simpli¬ 
fied  display  image  -  performance  oriented 
solution  to  the  acquisition  problem. 


AIRCRAFT  F4J-LEVEL  FLIGHT 


*  BASEO  ON  19  SEC  ACTION  TIME  AND  MACH  NUMBER 
11  TWO  FT  ACQUISITION  RANGE. 


Figure  1.  Man's  Visual  Capability  Limits 
the  System 
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Figure  2.  Target  Acquisition 
Variables 


Second,  attempts  should  be  made  to 
identify  summary  metrics  or  compound  vari¬ 
ables  which  account  for  performance. 

These  variables  must  be  empirically  mea¬ 
surable  at  the  display  face  and  directly 
relatable  to  performance.  Our  research 
has  identified  one  such  summary  metric, 
resolvable  lines-over-target  (Figure  4) 
(Levine,  Jauer  &  Kozlowski,  1970).  This 
measure  is  derived  from  J.  Johnson's 
(1958)  observations  concerning  resolution 
requirements  for  target  recognition/ 
identification  and  consists  simply  of  the 
product  of  the  target  height,  in  milli¬ 
meters,  as  imaged  on  the  screen,  and  the 
number  of  line  elements  per  millimeter 
that  can  be  resolved  by  the  observer  view¬ 
ing  the  display  under  the  same  conditions 
as  would  occur  during  target  acquisition. 
Analyzing  target  acquisition  performance 
from  2  separate  in-house  experiments 
revealed  a  correlation  of  from  .81  to  .88 
between  performance  and  resolvable  lines- 
over-target  (Figure  5).  This  measure  seems 
to  offer  a  simple  and  effective  estimate  of 
expected  performance,  given  a  known  display 
situation  and  target  size.  In  concept,  it 
is  closely  related  to  the  MTFA  which  yields 
a  predictive  measure  of  overall  performance 
with  a  variety  of  target  sizes. 
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Figure  4.  Summary  Metric 


Figure  3.  Concentrate  on 
This  Interface 


Figure  5.  Summary  Metric 
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MTFA  is  derived  from  the  modulation 
transfer  function  (MTF)  of  the  display  and 
the  contrast  demand  function  of  the  human 
observer.  The  MTF  is  a  plot  of  the  resolv¬ 
ing  power  of  the  display,  with  respect  to 
sinusoidal  resolution  patterns,  at  various 
modulations  or  contrast  levels.  For  CRT 
displays  as  contrast  decreases  the  display 
can  image  progressively  smaller  resolution 
elements.  Engineers  have  found  it  very 
useful  because  the  MTFs  of  all  the  elements 
in  a  display  system  can  be  combined  mathe¬ 
matically  to  obtain  the  MTF  of  the  final 
display  without  actually  putting  the  system 
together.  The  contrast  demand  function 
(CDF)  is  a  measure  of  the  contrast  the 
observer  requires  to  discriminate  between 
resolution  elements.  For  the  observer,  as 
contrast  increases  the  ability  to  perceive 
progressively  smaller  resolution  elements 
increases.  Plotting  the  two  curves  against 
increasing  modulation  and  resolution  yields 
a  monotonically  decreasing  function  for  MTF 
and  a  monotonically  increasing  function  for 
CDF.  The  area  encompassed  by  these  two 
functions  is  the  MTFA,  and  represents  the 
resolutions  that  the  display  can  image  and 
the  eye  can  see,  at  various  contrast 
levels.  This  measure  is  used  to  predict 
the  goodness  of  a  display  with  respect  to 
a  population  of  targets,  like  those  found 
in  the  real  world.  It  is  a  predictor  of 
average  performance.  The  resolvable 
lines-over- target  measure  takes  a  single 
measure  of  the  display/observer  resolution 
capability  at  a  single  contrast  and 
relates  this  to  a  specific  target  charac¬ 
teristic,  target  height.  Both  MTFA  and 
resolvable-lines-over-target  are  based  on 
the  same  perceptual  principles,  discrimi- 
nable  resolution,  however,  the  former 
yields  a  general  prediction  of  display- 
observer  performance  while  the  latter 
makes  predictions  concerning  specific 
targets  . 

Third,  the  relationships  between 
experimental  findings  relating  to  the  dis¬ 
play/observer  interface  should  be  directly 
related  to  aircraft/sensor  system  param¬ 
eters  (Figure  6).  This  relationship 
should  be  explicit  and  couched  in  language 
appropriate  for  design  engineering.  While 
the  findings  of  specific  studies  are  some¬ 
times  only  general  approximations  of  the 
true  relationships,  they  are  often  the 
only  data  available  on  the  interrelations 
between  display  system  parameters  and  tar¬ 
get  acquisition  performance.  As  such, 
they  can  play  an  important  role  in  guiding 


the  thinking  of  engineers  responsible  for 
hardware  selection  and  design.  Just  as 
scientific  data  must  be  communicated  to 
the  scientific  community  to  become  "real," 
technological  data  must  be  communicated  to 
the  engineer  to  be  useful. 
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Figure  6.  Explicate  Display/System 
Observer  Relationships 

The  research  program  which  led  to 
these  recommendations  was  part  of  an 
internally  funded  series  of  studies  on  the 
display  problems  specific  to  long  range 
real-time  sensor  systems.  This  evaluation 
has  considered,  to  date,  both  moving  and 
stabilized  image  applications  of  such  sys¬ 
tems.  The  following  section  reports  on 
experiments  concerning  both  of  these  image 
display  systems. 

MOVING  IMAGE  DISPLAYS 

The  utilization  of  long  focal  length 
electronic  sensors  in  conjunction  with 
real-time  and  near-real-time  displays  will 
extend  man's  target  acquisition  range  by 
functionally  magnifying  the  scale  of  the 
terrain  to  be  searched.  This,  in  theory, 
should  allow  faster  aircraft  speeds  while 
maintaining  or  increasing  the  time  to  tar¬ 
get  flyover.  These  systems  are  deployed 
aimed  out  of  the  aircraft  at  some  fixed 
downward  look  angle.  Using  this  method  of 
deployment  the  image  motion  across  the  dis¬ 
play  is  proportional  to  the  aircraft  speed 
and  high  speeds  result  in  rapidly  moving, 
short  time  on  display  images.  In  order  to 
effectively  evaluate  the  constraints  this 
image  motion  places  on  the  system,  a  series 
of  studies  of  target  acquisition  and  image 
motion  were  conducted.  These  studies  con¬ 
sidered  target  difficulty,  target  cues  and 
target  reference  images,  and  their  inter¬ 
relationship  with  image  motion. 

Two  exploratory  experiments  were  con¬ 
ducted  to  determine  target  acquisition  of 
military  targets  with  natural  backgrounds. 
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Targets  typical  of  low  and  mid-intensity 
conflicts,  such  as  bridges,  truck  parks, 
fortifications,  and  AM  sites,  were- selec¬ 
ted  from  KA-80  high  panoramic  imagery  of 
Southeast  Asia.  Twenty-five  inch  strips 
containing  the  targets  were  cut  from  each 
frame  and  spliced  together  to  make  a  stim¬ 
ulus  film.  The  film  was  advanced  and  con¬ 
verted  to  a  video  signal  with  a  Flying  Spot 
Scanner,  built  by  the  McDonnell  Douglas 
Reconnaissance  Laboratory.  The  output  sig¬ 
nal  from  the  scanner  was  a  standard  525 
line,  2:1  interlace,  30  frame  per  second 
video  output.  The  subjects  viewed  this 
image  on  a  "9"  Conrac  TV  monitor  mounted 
in  a  simulated  cockpit  (Figure  7).  Image 
presentation  and  motion  were  kept  uniform 
through  automatic  control.  The  CRT  image 
was  5.5  Inches  high  with  a  scale  of  1:2500. 
The  image  motion  provided  a  total  target 
time-on-display  of  1,  2,  3,  4,  5,  and  6 
seconds  which  was  equivalent  to  aircraft 
speeds  of  675,  338,  227,  169,  135,  and  113 
knots,  respectively.  The  performance  mea¬ 
sure  was  percent  correct  identification  of 
the  targets.  Correct  identification  was 
achieved  by  the  subject's  correctly  indi¬ 
cating  that  the  target  was  on  the  display. 


Figure  7.  Cockpit  Mockup  and  Display 


In  the  first  experiment  24  targets  and 
12  subjects  were  used.  Based  on  target 
size,  type,  concealment  and  background 
clutter,  targets  were  divided  into  diffi¬ 
cult  and  easy  groups.  The  difficult  tar¬ 
gets  were  revetments,  trench  fortifications 
and  small  truck  parts  and  had  a  display 
size  of  approximately  1/5  inch.  The  easy 
targets  were  3/4  inch  on  the  display  and 
consisted  of  large  truck  parts,  forts,  and 
fortified  positions.  Prior  to  the  run, 
subjects  were  read  the  instructions  and 
given  a  review  of  the  target  keys.  Each 


target  had  a  key  with  a  print  of  the  target 
and  a  brief  written  description.  The  keys 
covered  only  the  target  and  the  inmediately 
adjacent  ground  area,  and  were  available 
to  the  subjects  during  the  entire  experi¬ 
ment.  The  results  are  presented  in  Figure 
8.  Performance  decreased  as  speed 
increased  for  both  the  hard  and  easy  tar¬ 
gets  at  motion  rates  greater  than  1.7  inch 
per  second.  At  slower  rates,  performance 
was  nearly  constant.  The  performance 
curves  indicate  that,  for  the  display  size 
and  scale  used,  viewing  times  greater  than 
three  seconds  did  not  significantly 
improve  performance.  The  three  second 
value  appears  to  be  Independent  of  target 
difficulty  as  the  curves  for  the  easy  and 
difficult  targets  are  nearly  Identical  in 
shape.  While  a  major  determiner  of  the 
absolute  level  of  performance,  target  diffi¬ 
culty  does  not  appear  to  interact  with  the 
other  display  variables.  This  would  indi¬ 
cate  the  possibility  of  improving  perfor¬ 
mance  based  on  the  target  difficulty-image 
rate  tradeoff  curve.  For  example:  a  "dif¬ 
ficult"  target  at  a  given  scale  might  be 
viewed  at  twice  the  scale.  This  would 
decrease  the  viewing  time  to  one-fourth  the 
original,  i.e.,  from  6.0  to  1.5  seconds. 
However,  if  the  scale  increase  transferred 
the  target  category  from  "difficult"  to 
"easy,"  the  performance  would  be  increased 
from  55%  to  80%.  Additional  studies  vary¬ 
ing  image  scale  are  needed  to  fully  eval¬ 
uate  this  possibility. 
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Figure  8.  Study  One  Moving 
Window  Display 


In  the  second  experiment  ten  addi¬ 
tional  difficult  targets  were  selected  from 
imagery  similar  to  that  used  in  the  first 
experiment.  All  ten  targets  were  of  the 
same  approximate  size  and  shape;  however, 
five  of  them  were  situated  near  prominent 
terrain  features  such  as  roads  or  rivers. 
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The  other  five  targets  had  no  unique  geo¬ 
graphical  features  to  aid  in  their  acqui¬ 
sition.  Twelve  subjects  were  used.  Prior 
to  each  run,  the  subjects  were  given  a  one 
minute  period  to  study  target  briefing 
photos.  Each  set  of  briefing  photos  con¬ 
sisted  of  an  image  of  the  actual  target 
area  and  images  of  two  areas  very  similar 
to  the  target.  Unlike  the  first  experi¬ 
ment  the  briefing  material  was  not  avail¬ 
able  during  the  test  run.  Image  motion 
rates  of  675,  227,  and  135  knots  were 
used.  All  other  factors  were  the  same  as 
in  the  first  study.  The  results  are 
graphically  summarized  in  Figure  9.  The 
data  indicate  that  an  increase  in  viewing 
time  results  in  increased  performance  for 
the  targets  located  near  a  prominent 
terrain  feature.  For  the  other  targets, 
image  motion  rate  had  no  effect  on  per¬ 
formance,  and  performance  was  generally 
below  10  percent.  The  prominent  terrain 
feature  apparently  served  as  a  context  cue 
to  the  target's  location.  The  improvement 
in  performance  for  targets  with  natural 
cues  lends  credence  to  the  concept  of  pro¬ 
viding  artificial  cues  to  improve  perfor¬ 
mance.  Performance  data  for  the  ten  most 
difficult  targets  from  the  first  experiment 
is  also  plotted  in  Figure  9.  This  gives  a 
comparison  of  performance  when  target  keys 
are  available  during  a  mission.  A  compari¬ 
son  of  the  two  curves  indicates  that  the 
presence  of  target  keys  yields  performance 
similar  to  that  obtained  when  the  target 
is  near  a  major  context  one. 

Analysis  of  these  findings  with 
respect  to  system  utilization  indicate  that 
aircraft  speeds  yielding  more  than  3  sec¬ 
onds  on  display  are  required  for  good  per¬ 
formance.  The  system  does  not  appear  to  be 
adequate  for  small  targets  or  targets  of 
opportunity,  although  if  context  cues  or 
target  keys  can  be  provided  performance 
will  improve. 
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STABILIZED  IMAGE  DISPLAYS 

A  narrow  field  of  view  (FOV)  image 
stabilized  electro-optical  sensor  has  also 
been  suggested  as  a  means  of  extending 
man's  visual  performance  envelope.  This 
sensor  will  theoretically  allow  acquisition 
of  targets  at  extreme  standoff  ranges,  10- 
20  miles.  In  operation,  the  sensor  system 
will  be  turned  on  at  extreme  range  and 
aimed  through  the  use  of  the  aircraft's 
navigational  system.  Theoretically  the 
range  of  this  system  is  limited  by  atmo¬ 
spheric  effects  and  the  relationship 
between  aircraft  altitude  and  the  earth's 
curvature.  Practically,  however,  the 
effective  range  of  the  system  will  be 
determined  by  the  systems  operator's  abil¬ 
ity  to  locate  and  recognize  the  target  on 
the  display.  This  ability  is  a  complex 
function  of  a  large  number  of  variables 
derived  from  display,  sensor,  and  observer 
centered  parameters. 

A  number  of  studies  were  performed  to 
determine  the  systems  operator  constraints 
on  the  utilization  of  the  system,  and  where 
possible  to  develop  techniques  and  proce¬ 
dures  which  will  extend  its  operational 
utility.  Our  intial  efforts  were  directed 
toward  establishing  a  performance  baseline 
and  defining  the  boundary  conditions  for 
target  location  and  acquisition.  To  imple¬ 
ment  this  program,  a  simulation  of  a  long 
focal  length  airborne  TV  system  was  con¬ 
figured  with  the  McDonnell  Reconnaissance 
Laboratory's  Displays  Mockup  Facility. 

The  sensor  was  simulated  with  a  70  mm  Slide 
projector,  rear  projection  screen,  and  a 
closed  circuit  television  (CCTV)  camera 
system  with  a  remotely  controlled  zoom 
lens.  The  equipment  set  up  is  shown  in 
Figure  10.  The  input  scene  was  projected 
onto  the  rear  projection  screen  and  imaged 
by  the  zoom  lens  into  the  TV  camera.  By 
controlling  the  rate-of-zoom  lens'  set¬ 
ting,  a  simulation  was  made  of  a  fixed 
focal  length  system  in  a  moving  aircraft 
at  each  of  several  speeds .  The  rate  of 
zoom  was  controlled  with  a  position  servo 
system  that  tracked  an  input  signal  pro¬ 
portional  to  the  zoom  factor  as  a  function 
of  time.  The  optical  axis  of  the  lens  was 
held  fixed  with  respect  to  the  projected 
scene,  simulating  the  effect  obtained  with 
a  ground  stabilized  sensor  guidance  sys¬ 
tem.  The  subjects  viewed  the  display  on 
a  9  inch,  525  line,  Conrac  TV  monitor 
mounted  in  the  cockpit  mockup  (see  Fig¬ 
ure  7) . 
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Figure  10.  Stabilized  Image 
System  Simulation 


Forward  oblique  imagery  taken  from 
the  Reconnaissance  Data  Base  was  used  to 
construct  70  mm  (2-1/4  x  2-1/4  inch  image) 
slides  of  18  typical  targets.  Using  the 
estimated  resolution  losses  of  the  photo¬ 
copying  processes,  the  optical  projection 
system  and  an  assumed  minimal  resolution 
of  20  lp/mm  in  the  data  base  imagery,  it 
was  concluded  that  the  limiting  factor  of 
the  display  image  quality  was  the  CCTV 
system,  just  as  it  would  be  for  most  real- 
world  situations.  The  targets  included 
buildings,  POL  storage  tanks,  water 
towers,  and  electric  power  substations. 
Image  scales  were  chosen  to  produce  a  con¬ 
stant  1/4  inch  target  size  on  the  cockpit 
display  at  the  onset  of  each  trial.  With 
a  constant  displayed  target  size,  we 
defined  a  baseline  which  can  be  used  with 
the  true  target  size,  to  calculate  an 
acquisition  range  for  other  size  targets. 

The  output  of  the  zoom  system  was 
presented  on  a  5.5  inch  square  area  marked 
off  on  the  display  monitor.  At  the  start 
of  a  trial,  the  projected  image,  at  mini¬ 
mum  scale,  to  represent  the  sensor  cover¬ 
age  at  long  range,  was  presented  on  the 
cockpit  display  and  the  observer  began  his 
search  for  the  target.  As  the  trial 
progressed,  the  CCTV  system  zoomed  in  on 
the  center  of  the  image  with  the  rate  of 


zoom  appropriate  to  the  simulated  speed. 
This  system  simulated  a  narrow  (approxi¬ 
mately  one  to  two  degree)  FOV  sensor  at  20 
miles  initial  range  and  a  20,000  feet 
altitude.  The  targets  were  acquired  by 
slewing  a  cursor  to  a  position  over  the 
target  image  and  pressing  a  fire  button  on 
the  cursor  control  joystick.  Target  keys 
were  available  for  reference  at  all  times 
during  the  trial . 

The  controlled  variables  were  air¬ 
craft  speed,  target  location  in  the  FOV, 
and  type  of  key.  Aircraft  speeds  of  360, 
560,  900  and  1200  knots  were  simulated. 

When  the  target  was  in  the  center  of  the 
display,  zooming  in  enlarged  the  displayed 
target.  When  the  target  was  offset  from 
the  center,  zooming  in  increased  the  off¬ 
set  as  well  as  the  displayed  target  size. 
For  sufficiently  large  offsets  the  target 
went  off  the  display  (out  of  the  sensor 
FOV)  as  the  lens  zooms  in  on  the  center  of 
the  image.  Three  concentric  bands  were 
used  to  identify  target  location  in  the 
FOV.  The  bands  were  from  center  to  1/3  the 
distance  to  the  edge  of  the  image,  from  1/3 
to  2/3  the  distance  to  the  edge  of  the 
image,  and  from  2/3  to  the  edge  of  the 
image.  Each  band  contained  six  targets  at 
random  azimuths. 

Target  briefing  cues,  target  keys, 
were  also  varied.  One  group  of  12  subjects 
received  a  15X  enlargement  of  the  target 
and  immediate  surrounding  area  printed  on 
a  5  x  7  inch  glossy  photograph.  Another 
group  of  8  subjects  received  a  line  sketch 
of  the  target  area  traced  from  the  5x7 
photo  (Figure  11).  The  line  sketch  had 
greater  than  100:1  data  compression  when 
compared  with  the  photo.  This  is  signifi¬ 
cant  when  transmitting  images  over  an 
electronic  data  link  system. 


Figure  11.  Picture  and  Sketch  Target 
Acquisition  Aids 
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Time  to  acquire  the  target  and  target 
acquisition  accuracy  were  measured.  The 
data  were  plotted  as  a  function  of  aircraft 
speed  with  the  effects  of  offset  and  target 
keys  as  parameters  and  an  analysis  of  vari¬ 
ance  was  performed.  The  data  indicate  that 
accurate  target  acquisition  can  be  obtained 
if  the  target  is  in  the  center  2/3  of  the 
FOV  (Figure  12).  Time  to  acquire  the  tar¬ 
get  did  not  vary  with  offset  from  the  cen¬ 
ter  of  the  image.  All  offset  positions 
showed  an  approximately  linear  drop  in 
acquisition  time  of  about  15  seconds  as 
speed  varied  from  360  to  1200  knots.  This 
lack  of  position  effect  on- time-to-acqui re 
the  target  is  not  surprising  as  the  targets 
were  originally  screened  by  offset  groups 
to  select  six  targets  having  a  minimum 
spread  of  acquisition  times.  Photo  keys 
were  superior  to  line  sketched  keys  in  both 
acquisition  time  and  percent  of  correct 
acquisitions  (Figure  13).  Overall  a  10 
percent  reduction  in  performance  and  a  7 
second  increase  in  time  were  found  using 
the  line  sketch  keys.  An  evaluation  of 
the  offset  by  briefing  aids  interaction, 
however,  indicated  that  while  there  were  no 
significant  time  effects,  there  was  a  sig¬ 
nificant  performance  interaction.  The 
effects  of  the  sketch  versus  the  photo 
briefing  aid  were  not  significant  for  the 
center  2/3  of  the  image,  thus  if  the  navi¬ 
gation  accuracy  can  put  the  target  in  the 
center  2/3  of  the  Image,  the  sketch  aid  can 
be  utilized  with  little  or  no  penalty  in 
percent  correct  acquisition. 
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Figure  13.  Study  One  Effects  of 
Briefing  Aids 


The  general  effect  of  aircraft  speed 
was  a  15  second  drop  in  acquisition  time 
and  a  10  percent  drop  in  performance  over 
the  speed  range  of  360  to  1200  knots.  The 
maintenance  of  a  good  degree  of  accuracy 
at  the  1200  knot  speed  Indicates  a  great 
potential  for  this  sensor  system  in  high 
performance  aircraft. 

A  second  study  was  conducted  to  eval¬ 
uate  the  effects  of  changes  in  FOV  on  per¬ 
formance.  This  study  used  the  same  proce¬ 
dure  and  photo  briefing  aids  as  the  first 
study.  The  variables  manipulated  were 
closing  rate,  navigation  accuracy  and  FOV. 
Closing  rates  of  360,  560,  900  and  1200 
knots  were  simulated  by  automatically  pre¬ 
setting  the  zoom  rate  on  the  CCTV  system. 
Fields  of  view  of  1.30  and  .90  were  used, 
the  latter  resulting  in  an  increase  in  the 
scale  of  the  image  and  a  decrease  in  the 
ground  coverage.  This  was  simulated  by 
starting  the  zoom  procedure  at  different 
magnifications.  The  change  In  starting 
point  resulted  in  image  viewing  times  of 
45,  58,  98,  and  158  seconds  for  the  1.3° 
FOV  and  30,  39,  65,  and  106  seconds  for 
the  .9°  FOV.  In  all  cases  the  time-on- 
display  for  the  narrow  FOV  exceeded  the 
average  time  for  acquisition  for  the  wide 
FOV,  insuring  that  the  narrow  FOV  did  not 
excessively  distort  performance  due  to 
differences  in  presentation  time. 

Performance  was  measured  with  respect 
to  time  to  acquire  the  target  and  target 
acquisition  accuracy.  This  data  was  plot¬ 
ted  with  respect  to  the  effects  of  FOV  as 
a  function  of  aircraft  speed  and  appears 
in  Figure  14. 
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Figure  14.  Study  Two  Effects  of  FOV 
and  Closing  Rate 


The  area  covered  by  the  narrow  FOV 
contained  only  those  targets  in  the  center 
2/3  of  the  wide  FOV  scene,  however,  it  dis¬ 
tributed  their  locations  over  the  entire 
image.  Comparisons  were  therefore  made  of 
both  a  truncated  12  target  sample  (those 
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targets  in  the  center  2/3  of  the  image) 
and  the  complete  18  target  sample  (Figure 
14A).  The  number  correct  for  those  tar¬ 
gets  which  appeared  in  both  FOVs  varied  as 
a  function  FOV,  with  the  wide  1.3°  FOV 
yielding  better  performance.  For  this 
case,  the  targets  all  appeared  in  the 
central  2/3  of  the  display  and  as  such  had 
long  display  times  and  a  favored  search 
position,  both  of  which  typically  increase 
performance.  If  the  narrow  FOV  data  is 
compared  with  the  full  compliment  of  wide 
FOV  targets,  so  the  target  distribution 
over  the  image  is  the  same  (Figure  14A), 
the  overall  difference  between  the  two 
FOVs  is  not  significant.  An  analysis  of 
the  time  to  acquire  the  target  indicates 
that  faster  acquisition  times  are  found 
with  the  narrow  FOV  (Figure  14B).  This  is 
most  probably  due  to  the  increased  scale 
associated  with  this  FOV. 

Overall,  relating  the  findings  of  both 
studies  to  system  design,  a  sensor  pointing 
accuracy  sufficient  to  place  the  target  in 
the  center  2/3  of  the  display  will  give  the 
best  acquisition  performance.  Changes  in 
FOV  can  be  accounted  for  by  changes  in 
placement  in  the  scene  and  increases  in 
scale.  Since  this  is  a  zoom  system  small 
scale  targets  will  be  seen  at  larger  and 
larger  scales  as  the  aircraft  closes  with 
the  target.  This  factor  washes  out  the 
effects  of  initial  target  size,  provided 
the  target  remains  on  the  display  as  the 
system  zooms  in  on  it.  Placement  in  the 
center  2/3  of  the  image  ensures  both 
adequate  scale  and  search  time  during  the 
simulated  approaches.  Additional  work  is 
needed  to  define  precise  limits  of  this 
relationship  with  respect  to  initial  scale 
and  other  zoom  rates. 

Target  acquisition  from  aircraft  with 
speeds  as  high  as  1200  knots  are  possible 
utilizing  this  system.  However,  at  these 
higher  speeds  the  available  time  approaches 
the  threshold  of  required  time  for  critical 
combinations  of  actual  target  size  and 
sensor's  angular  resolution.  Converting 
the  findings  into  an  operator  envelope  and 
comparing  it  with  the  visual  envelope 
(Figure  15),  we  find  a  potential  doubling 
of  the  usable  portion  of  the  aircraft 
envelope. 

The  line  sketch  target  key  offers  a 
100:1  data  compression  advantage  at  a  cost 
of  only  10  seconds  time  and  10%  performance 
degradation.  If  the  navigation  accuracy 


requirements  are  met  and  the  target  is  in 
the  center  2/3  of  the  display  the  sketch 
can  be  used  at  no  cost  in  performance 
degradation.  The  narrow  bandwidth  required 
to  transmit  the  line  sketch  key  gives  this 
technique  a  special  potential  for  use  in  a 
hunter  killer  team  to  destroy  fleeting  tar¬ 
get. 


AIRCRAFT-LEVEL  FLIGHT 


Figure  15.  Sensors  Can  Double  Usable 
Flight  Envelope 
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Abstract:  A  representative  example  is  cited 
of  the  coordinated  use  of  complementary 
research  methods  to  systematically  improve 
quantitative  prediction  and  operational  per¬ 
formance  in  target  acquisition.  A  modeling 
concept  is  described  that  provides  for  the 
organized  treatment  of  problem  parameters, 
an  evolutionary  data  management  system, 
and  the  identification  of  leveraged  parameters 
offering  significant  improvement  in  prediction 
and/ or  performance.  Illustrative  applications 
of  analysis,  simulation,  laboratory  testing, 
and  field  test  validation  are  then  used  as 
examples  of  how  prediction  capabilities  and 
system  performance  can  be  improved  relative 
to  the  parametric  implications  of  existing 
models.  The  examples  used  are  scaling  and 
classification  of  target  backgrounds,  and 
head -up  target  predesignation  using  onboard 
navigation  system  information. 

INTRODUCTION 

The  performance  of  current  operational 
systems  is  severely  restricted  by  a  limited 
ability  to  acquire  and  identify  surface  targets 
in  time  to  permit  the  effective  delivery  of 
ordnance.  The  visual  search  requirements 
imposed  on  the  pilot,  who  must  collect  and 
process  large  quantities  of  visual  and/or 
sensor-mediated  information  in  real  time* 
typically  result  in  target  acquisition  perfor¬ 
mance  levels  substantially  below  those  of 
which  the  system  is  theoretically  capable. 
Technological  solutions  to  this  problem  require 
an  improved  parametric  understanding  of 
the  operational,  environmental,  design,  and 
crew  performance  factors  affecting  system 
performance,  the  identification  of  high-lever- 
aged  system  variables  offering  significant 
improvement  potential,  and  the  systematic 


evaluation  of  performance  improvement 
concepts  under  real-time  man -in -the - 
loop  conditions  characteristic  of  actual 
mission  situations. 

Progress  towards  improved  pre¬ 
diction  and  performance  in  the  target 
acquisition  task  can  be  greatly  facilitated 
by  the  coordinated  use  of  several  comple 
mentary  research  methods.  Parametric 
modeling  serves  as  a  useful  tool  for 
organizing  problem  variables  and  quali¬ 
fied  data.  It  also  focuses  attention  on 
quantitative  data  deficiencies,  and  tends 
to  indicate  the  areas  of  research  that 
potentially  could  lead  to  significant  per¬ 
formance  improvements.  Techniques  of 
analysis,  laboratory  testing,  and  simu¬ 
lation  can  then  be  applied  as  appropriate, 
singly  or  in  combination,  to  satisfy  high- 
leveraged  data  requirements  or  to  test 
a  given  improvement  concept.  Finally, 
controlled  field  tests  can  be  used  to 
verify  and/or  calibrate  selected  points 
in  the  overall  performance  matrix.  It 
is  probable  that  each  of  these  methods, 
when  combined  in  a  systematic  and 
coordinated  fashion,  contribute  more  to 
an  improved  knowledge  of  target  acqui¬ 
sition  than  any  single  method  used 
independently.  This  is  especially  true 
where  an  economical  expenditure  of 
resources  is  a  prime  constraint. 

A  MODELING  FRAMEWORK 

While  a  detailed  model  description 
is  beyond  the  scope  of  this  paper,  a 
general  explanation  of  the  modeling 
scheme  in  use  will  assist  in  showing  the 
inter-relationship  between  it  and  other 


179 


research  and  analysis  efforts.  This  modeling 
scheme  was  initially  developed  for  application 
to  the  visual  target  acquisition  problem 
(Gilmour  and  Emerson,  1965),  and  has  since 
been  used  as  a  general  format  for  detailed 
parametric  modeling  of  both  the  day -visual 
and  night/ all -weather  tasks.  Although  the 
basic  framework  remains  the  same,  models 
dealing  with  radar,  TV/LLLTV,  and  IR/FLIR 
additionally  interpose  the  appropriate  sensor/ 
processor/ display  variables  between  the 
operator  and  his  environment.  Only  the 
generalized  format  will  be  considered  here. 

To  be  widely  useful,  it  is  desirable  that 
an  acquisition  model  organize  the  problem 
parameters  in  relation  to  operational  perfor¬ 
mance  criteria  that  have  a  real  world  counter¬ 
part.  In  this  way,  the  objectives  of  the  model 
can  be  oriented  toward  defining  the  functional 
relationships  between  operational,  environ¬ 
mental,  design,  and  aircrew  variables  and 
the  mission  performance  of  the  system.  The 
ultimate  criteria  in  this  case  is  acquisition 
performance,  but  several  intermediate 
criteria  can  also  be  established  which  provide 
for  the  systematic  and  logical  treatment  of 
parameters  and  which  give  recognition  to  the 
highly  sequential  nature  of  the  acquisition 
task. 

In  the  present  case,  the  operational  per¬ 
formance  criteria  of  the  model  were  defined 
in  terms  of  a  listing  of  sequential  require¬ 
ments  for  successful  target  acquisition  per¬ 
formance.  These  requirements  included  four 
major  elements:  (1)  Geometric  Intervisibility  - 
providing  geometric  line -of -sight  and  com¬ 
patibility  between  available  field -of-view  and 
the  ground  area  containing  the  target,  (2) 
Visual  Target  Availability  -  providing  a 
supraliminal  presentation  of  the  target  element 
to  the  observer,  (3)  Real-Time  Search  - 
successful  visual  sampling  of  the  target 
element  from  all  of  the  dynamic  scene  ele¬ 
ments  present,  and  (4)  Discrimination  and 
Decision  -  requiring  detail  discrimination  to 
a  level  necessary  to  satisfy  the  observer's 
pre-stored  "target"  definition. 

It  can  be  noted  that  these  requirements 
are  cumulative  and  sequentially -dependent 
with  respect  to  successful  acquisition  of  an 
assigned  target.  It  is  also  evident  that 
certain  groups  of  variables  will  be  operative 
in  determining  whether  each  particular 
sequential  requirement  is  met.  (However, 
categories  of  variables  are  not  necessarily 


exclusive  to  a  single  requirement,  since 
some  variables  can  be  seen  to  operate  on 
different  requirements  in  more  than  one 
way.)  This  permits  the  operational 
performance  criteria  of  the  acquisition 
model  to  be  specified  in  terms  of  the 
particular  group  of  contributing  parameters, 
A  general  schematic  of  the  resulting 
model  is  illustrated  in  Figure  1,  showing 
three  intermediate  performance  criteria 
as  exposure,  detection  and  identification 
submodels.  (These  terms  are  used 
largely  for  convenience  in  notation;  the 
reader  is  invited  to  substitute  other 
labels  if  desired.)  The  exposure  and 
detection  submodels  correspond  to  the 
first  two  requirements  listed  above,  while 
the  latter  two  requirements  have  been 
combined  in  the  identification  submodel. 

Each  submodel  addresses  a  specific  question 
relative  to  the  sequential  requirements  for 
successful  acquisition.  Each  also  treats 
a  specific  group  of  variables,  examples 
of  which  are  listed  in  Figure  1. 
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Figure  1.  Target  Acquisition 
Prediction  Model 


The  sequential  nature  of  the  acqui¬ 
sition  task  is  further  illustrated  by  placing 
the  above  model  on  a  range/ time  continuum. 
This  is  shown  in  Figure  2,  where  target 
range  and  time-to-go  decrease  progressively 
to  the  right.  In  the  hypothetical  example 
shown,  the  upper  limits  of  acquisition  per¬ 
formance  are  constrained  by  target  availa¬ 
bility  (the  combination  of  exposure  and  de- 
taction  criteria).  Actual  performance 
(identification),  however,  characteristically 
occurs  much  later  than  this  upper  limit. 
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E+D-TARGET  AVAILABILITY 


E+D+l  TARGET  ACQUISITION 


Figure  2.  Acquisition  Model  Range/Time 
Sequence 

Prediction  of  the  visual  search  and 
discrimination  performance  that  accounts 
for  this  difference  has  never  been  particu¬ 
larly  successful,  largely  due  to  our  in¬ 
ability  to  deal  quantitatively  with  the  vari¬ 
ables  operating  in  the  identification  sub¬ 
model*  This  has  been  particularly  unfor¬ 
tunate  both  because  the  unpredicted  per¬ 
formance  variations  in  this  region  are 
typically  very  substantial,  and  also  because 
a  large  number  of  delayed  or  unsuccessful 
acquisitions  occur  that  cannot  be  systemat¬ 
ically  explained  or  corrected.  It  may,  on 
occasion,  be  worthwhile  to  consider  the 
upper  limits  of  performance  attainable  in 
a  system,  but  where  the  system  seldom  if 
ever  performs  this  way  operationally,  it  is 
a  mistake  to  take  this  as  a  routine  ex¬ 
pectation. 

TA  RGET/BA  CKGROUND 
SCALING  AND  CLASSIFICATION 

One  parameter  contributing  to  the 
identification  submodel  analysis  has  been 
particularly  troublesome.  This  factor  has 
to  do  with  the  embeddedness  (or,  conversely, 
conspicuity)  of  a  particular  target  in  a  par¬ 
ticular  background  or  setting.  Both  simu¬ 
lation  and  field  test  data  consistently  show 
extremely  wide  performance  variations  that 
are  associated  with  so-called  target  dif¬ 
ferences.  Such  variations  may  often  comprise 
70-80%  of  the  total  test  variability.  An  ex¬ 
ample  of  this  is  shown  in  Figure  3,  taken 
from  the  simulator  validation  test  portion  of 
a  large-scale  field  test. 

The  performance  variations  represented 
are,  of  course,  not  strictly  a  matter  of  dif¬ 
ferences  between  the  target  items  themselves. 


Rather,  they  represent  the  total  problem  of 
visual  search  in  a  dynamic  and  complex  vis¬ 
ual  environment,  and  hence  include  factors 
related  to  targets,  backgrounds,  and  the 
interactions  between  the  two.  The  central 
question  in  visual  search  is  the  time  (and 
range)  required  by  the  observer  to  visually 
sample  the  desired  target  item  from  the 
barrage  of  visual  information  available.  This 
will  be  determined  largely  by  the  character¬ 
istics  of  the  background  or  target  setting 
(terrain,  cultural  development,  contextual 
cues,  etc.)  and  by  the  target’s  relationship 
to  these  characteristics. 


Figure  3.  Cumulative  Acquisition 
Probability  as  a  Function  of 
Range  for  10  Prebriefed  Targets 

This  is  illustrated  in  the  acquisition 
performance  data  shown  in  Figure  4.  Per¬ 
formance  is  seen  to  differ  widely  on  two 
similar  f bridge”  targets  located  in  different 
settings.  Cultural  development  and  terrain 
patterns  differed  considerably  in  the  two 
cases. 


Figure  4.  Acquisition  Curves  for  Two 
Bridges  -  Different  Backgrounds 
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As  a  further  example  of  the  effects  of 
visual  search  requirements  on  acquisition 
performance,  Figure  5  shows  performance 
in  a  typical  target/background  encounter 
plotted  as  a  percentage  of  maximum  avail¬ 
able  range  of  the  target.  The  use  of  approxi¬ 
mately  60%  of  the  available  range/time  for 
search,  while  reasonably  characteristic, 
varies  substantially  under  different  target/ 
background  conditions. 


Figure  5.  Cumulative  Probability 
Curve  Fit  to  a  Set  of  JTF-2 
Test  Data  (Target  W-4) 

In  a  study  recently  completed  for  the 
Office  of  Naval  Research  (Zaitzeff,  1971), 
initial  work  was  begun  on  a  technique  to 
predict  target/background  performance 
effects.  This  study  made  use  of  160* 

70mm  color  motion  picture  and  reconnais¬ 
sance  imagery  from  JTF-2  target  acqui¬ 
sition  field  test  courses,  and  of  dynamic 
simulator  acquisition  performance  data 
which  had  previously  been  validated  against 
large-scale  military  field  test  results 
(Wyman,  Gilmour,  and  Snyder,  1968).  For 
purposes  of  this  study,  static  scenes 
were  extracted  from  the  cinematic  imag¬ 
ery  at  10  discrete  ranges  relative  to  each 
of  10  targets.  Acquisition  performance 
data  consisted  of  encounters  of  each  of 
the  10  targets  by  a  total  of  160  operation- 
ally-ready  military  aircrews. 


The  approach  followed  was  to  in¬ 
vestigate  a  number  of  measurable  prop¬ 
erties  of  each  of  the  scenes  representing 
different  stages  of  the  target  encounters. 
Analyses  were  then  performed  to  evaluate 
the  independence,  quantitative  effect,  and 
relative  predicting  power  of  these  variables 
relative  co  the  criterion  performance. 
Finally,  a  regression  equation  was  developed 
that  offered  significant  prediction  capa¬ 
bilities,  economy,  and  reasonable  prospects 
for  stability  when  generalized  to  new  target/ 
background  situations. 


The  15  candidate  classification  para¬ 
meters  investigated  are  listed  in  Figure  6 
according  to  the  measurement  techniques 
used.  They  are  largely  self-explanatory 
except  for  the  subjective  scaling  measures. 
Separate  studies  using  16  subjects  each 
were  conducted  to  evaluate  the  number  of 
potential  confusion  (target-like)  elements 
in  each  progressive  scene  (ambiguity), 
the  percentage  of  observers  designating 
the  target  with  a  high  confidence  level  for 
a  given  scene  (static  acquisition),  and  the 
degree  to  which  individual  scenes  were 
judged  to  be  visually  nonuniform  on  a  ten- 
point  scale  (heterogeniety).  In  the  last  in¬ 
stance  no  target  was  specified;  in  the  other 
studies  pretest  briefing  materials  were 
limited  to  the  target  item  and  its  immediate 
surround. 
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The  purpose  of  this  study  was  to 
begin  development  of  a  target/background 
scaling  and  classification  metric  that 
could  be  used  to  predict  visual  search  per¬ 
formance  over  a  wide  range  of  target/ 
background  conditions.  The  initial  ob¬ 
jective  was  a  preliminary  algorithm  re¬ 
lating  measured  characteristics  of  target/ 
background  scenes  to  the  actual  visual 
search  performance  of  aircrews  when 
acquiring  these  targets  under  real-time 
conditions. 


Figure  6.  Candidate  Target/Background 
Classification  Parameters 

Analyses  performed  to  compare  meas¬ 
ured  target/background  characteristics  with 
the  criterion  dynamic  acquisition  performance 
are  summarized  in  Figure  7.  Factor  analysis 
was  used  primarily  as  an  investigative  tool 
to  examine  the  correlations  among  the 
original  set  of  independent  variables  and  the 
criterion.  Moderately  high  correlations  were 


observed  in  most  cases,  and  were  generally 
in  the  expected  direction.  A  rotated  factor 
matrix,  however,  revealed  a  total  of  7  ortho¬ 
gonal  factors  with  a  distribution  of  loadings 
that  suggested  a  moderate  amount  of  inter¬ 
dependence  among  the  set  of  variables  and 
the  existence  of  a  number  of  redundancies. 
This  in  turn  suggested  that  almost  as  much 
predictive  power  could  be  obtained  from  a 
considerably  reduced  set  of  variables. 


Figure  7.  Target /Background  Analyses 


Linear  stepwise  regression  was  used 
to  determine  the  quantitative  contributions 
of  the  original  variables  to  the  prediction  of 
acquisition  performance  on  the  targets  used 
in  this  study.  A  regression  equation  using  all 
15  variables  was  seen  to  account  for  93%  of 
the  criterion  variance,  while  an  equation  using 
a  reduced  set  of  7  of  these  variables  was 
still  able  to  account  for  90%  of  this  variance. 
Due  to  the  overlap  and  interdependence  be¬ 
tween  the  variables,  however,  it  was  difficult 
to  evaluate  their  true  relative  predictive 
power.  Direct  generalization  of  results  to  a 
wide  range  of  target/ background  conditions 
would  thus  involve  the  assumption  of  a  nearly 
identical  dependence  between  the  variables 
whenever  the  technique  was  applied. 

As  a  result,  ridge  regression  (Hoerl  and 
Kinnard,  1970)  was  applied  to  obtain  the  best 
estimate  of  the  individual  sensitivities  and 
coefficients  of  the  original  independent  vari¬ 
ables.  Using  this  approach,  the  solution 
using  15  variables  was  able  to  account  for 
82%  of  the  criterion  variance.  A  reduced  set 
of  7  variables  predicted  79%  of  the  criterion 
variance.  The  reduced  variable  set  included 
measured  target  width,  target  contrast, 
target  detail  contrast,  calculated  length, 
ambiguity,  heterogeniety,  and  total  brightness 
elements. 


A  representative  application  of  the 
regression  equation  using  the  reduced  set 
of  seven  target/background  variables  is 
illustrated  in  Figure  8,  showing  a  com¬ 
parison  of  predicted  values  with  a  least- 
squares  curve  fit  to  the  actual  dynamic 
acquisition  performance  data.  For  the 
particular  target/background  case  shown, 
the  maximum  available  range  of  the  target 
was  approximately  36,  500  feet.  The  de¬ 
crement  between  this  range  and  the  actual 
acquisition  performance  curve  (.  5P  = 
17,500  ft.)  had  previously  represented  an 
area  of  largely  unpredicted  test  vari¬ 
ability. 
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Figure  8.  Target  Acquisition  Model 
Prediction  of  Large-Scale  Field 
Test  Data 

Continuing  target/background 
classification  studies  are  expected  to 
refine  the  initial  scaling  technique, 
and  to  verify  its  application  to  an  expanded 
range  of  targets  and  terrain.  Questions 
also  exist  as  to  the  extraction  of  pre¬ 
dictive  information  from  different  types  of 
premission  intelligence/reconnaissance 
data,  as  well  as  the  application  of  pre¬ 
diction  techniques  to  various  types  of 
aided -visual  and  non -visual  sensor/display 
modes.  In  addition  to  improved  analytical 
prediction  capabilities,  however,  near-term 
emphasis  will  be  continued  on  the  identi¬ 
fication  of  a  meaningful  and  economical 
set  of  target/background  scaling  variables 
that  directly  impact  the  decisions  of  the 
system  engineer  and  operational  field  com¬ 
mander. 

CUEING  AND  PREDESIGNATION  AS 
SEARCH  AIDS 

Capabilities  to  quantitatively  evalu¬ 
ate  and  predict  visual  search  performance 
In  a  complex  and  dynamic  environment 
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open  the  way  for  the  systematic  develop¬ 
ment  of  performance  aid  techniques  that 
can  best  assist  the  operator  with  this  part 
of  the  acquisition  task.  In  addition, 
methods  for  objective  scaling  and  classi¬ 
fication  of  target/background  conditions 
will  permit  the  field  commander  and  air¬ 
crew  to  select  those  operating  forces, 
flight  conditions,  tactics,  sensor  modes, 
and  weaponry  most  likely  to  result  in  a 
successful  mission  outcome. 

As  previously  shown  in  Figure  5, 
targets  are  rarely  acquired  at  a  range 
and  time  approximating  their  first  availa¬ 
bility  to  the  operator.  Rather,  target/ 
background  conditions  more  typically 
result  in  search  requirements  that  impose 
severe  range  penalties.  Not  infrequently 
these  constraints  mean  that  the  target  is 
never  acquired  at  all,  even  though  it  has 
been  visually  available  for  a  considerable 
time  and  range.  Such  range/time  penalties 
translate  themselves  into  lowered  accuracies 
during  rapid  target  designation,  reduced 
probabilities  of  successful  conversion  and 
weapon  delivery,  and  other  decrements  that 
lessen  the  chances  of  a  successful  first- 
pass  target  encounter.  A  critical  need  thus 
exists  to  develop  performance  aids  that 
significantly  assist  the  operator  with  the 
visual  search  problems  imposed  by  target/ 
background  conditions. 

Performance  aiding  concepts  vary  over 
a  wide  range,  both  as  to  the  basic  approach 
and  the  manner  in  which  they  are  employed 
and  mechanized.  In  general,  most  involve 
providing  the  operator  with  new  items  of 
collateral  information  designed  to  speed  the 
search  task,  or  presenting  existing  infor¬ 
mation  in  a  more  timely  and  readily  usable 
form.  Most  have  the  objective  of  reducing 
the  amount  of  visual  sampling  and  processing 
that  must  be  performed  by  the  operator  in 
real-time,  and  earlier  tests  have  shown 
that  the  most  successful  techniques  for 
accomplishing  this  will  often  be  heavily  de¬ 
pendent  on  the  particular  mission  situation, 
flight  conditions,  and  environment,  as  well 
as  the  availability  of  collateral  data  from 
either  on-board  or  cooperative  air/ground 
sources. 

One  of  the  most  elementary  forms  of 
aiding  may  be  considered  to  be  the  develop¬ 
ment  of  premission  reconnaissance/ intelli¬ 
gence  data  that  are  closely  tailored  to  the 
actual  conditions  under  which  the  target  will 
be  encountered.  Recent  tests  indicated  that, 


for  certain  target/background  and  mission 
conditions,  such  data  can  result  in  a  2  to  1 
improvement  in  acquisition  performance 
(Jahns  and  Gilmour,  1968). 

Further  improvements  have  been  indi¬ 
cated  when  such  data  are  presented  on  a 
timely  inflight  basis.  Examples,  showing  a 
progressively  more  useful  format,  include 
digital  display  of  target/ checkpoint  coordinates, 
direct  readout  of  azimuth  and  time-to-go 
information,  and  sequential  pictorial  display 
of  checkpoints  and  target  lead-in  cues  that 
can  be  directly  correlated  with  the  operator's 
real  time  visual  scene  or  display. 

One  extension  of  these  concepts  has 
recently  been  tested  in  detail,  and  serves 
to  illustrate  the  improvement  potentials 
available  (Leininger  and  Logan,  1971). 

In  this  concept,  predicted  target/ check¬ 
point  locations  were  predesignated  to  the 
aircrew  via  a  target  window  in  a  head- 
up  display  based  on  onboard  navigation 
system  information.  Several  levels  of 
navigational  precision,  and  predesignation 
accuracy,  were  evaluated.  Testing  con¬ 
sisted  of  600  encounters,  involving  20 
tactical  targets,  using  wide-screen  70mm 
color  motion  picture  simulation  techniques. 

Overall  results  are  summarized 
in  Figures  9-11.  Figure  9  indicates  the 
performance  of  all  predesignation  encoun¬ 
ters  relative  to  previously-established  un¬ 
aided  performance  levels.  The  upper 
limits  of  performance  are  also  indicated 
by  the  target  availability  curve  (theoretical 
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Figure  9.  Head-Up  Target 
Predesignation 
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maximum  performance  by  all  aircrews  on 
all  target  encounters).  Performance  im¬ 
provements  in  acquiring  individual  targets 
using  predesignation  frequently  permitted 
successful  target  conversion  and  weapon 
delivery  that  would  not  otherwise  have 
been  possible. 


Targets  Where  Control  Group  Performance 
is  Less  Than  t.O 

Nteifnitki  Mmimiki  InprmfnMt 


Figure  10.  Probability  of  Acquisition 
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Figure  11.  Acquisition  Range 
Improvement  Obtained  with 
Predesignation 

Figures  10  and  11  show  some  of 
the  individual  target/ background  effects, 
and  the  extent  to  which  the  predesignation 
technique  was  successful  in  reducing  the 
operator's  search  task.  Predesignation 
was  observed  to  be  highly  effective  in 
reducing  missed  targets,  as  reflected  in 
overall  acquisition  probabilities.  Acqui¬ 
sition  range  improvements  were  also 
significant,  and  quite  target  specific, 


with  certain  target/background  situations 
benefiting  considerably  more  from  the 
use  of  the  predesignation  technique. 

Further  extensions  of  this  aiding 
concept  are  currently  being  developed 
and  evaluated,  including  the  coordinated 
use  of  inertial  and  radar-based  pre¬ 
designation  in  combination  with  range¬ 
extending  concepts  such  as  TISEO.  Con¬ 
sideration  is  also  being  given  to  acquisition 
aiding  techniques  which  provide  a  smooth 
interface  and  transition  into  fire  control 
and  delivery  methods  using  both  conventional 
and  standoff  weapons. 

A  wide  range  of  aiding  concepts 
for  both  day -visual  and  night/ all -weather 
applications  remain  to  be  evaluated.  These 
include  the  selective  filtering  of  meaningful 
information  from  different  sensor  sources, 
each  of  which  may  be  capable  of  collecting 
much  more  data  than  the  operator  can 
process,  and  the  systematic  use  of  cueing 
data  to  enhance  performance  with  the  most 
appropriate  primary  acquisition  sensor.  Also, 
sources  of  cueing  information  are  not  limited 
to  onboard  systems,  but  may  include  coopera¬ 
tive  air/ ground  elements  as  well. 

In  a  study  currently  in  progress,  cueing 
requirements  for  a  target  acquisition  system 
utilizing  a  primary  FUR  display  are  being 
investigated.  Of  particular  concern  are  the 
resolution  and  discrimination  capabilities  of 
the  cueing  sensor,  and  the  extent  to  which 
inaccurate  false  alarm  cueing  data  (from  what¬ 
ever  source)  will  be  apt  to  counteract  the 
benefits  of  cueing  on  the  primary  acquisition 
display. 

No  single  approach  to  aiding  the  opera¬ 
tor  in  the  target  search  task  is  likely  to  pro¬ 
vide  an  overall  solution  to  the  target  acquisition 
problem.  Rather,  as  in  the  case  of  weapon - 
target  matching,  aiding  techniques  will  have 
to  be  developed  and  applied  with  a  view  to 
the  different  mission,  environment,  and  system 
design  conditions  that  apply,  and  to  the  particu¬ 
lar  target/background  search  situations  that 
result  when  these  conditions  are  imposed. 
Analysis,  modeling,  laboratory  research, 
simulation  testing,  and  field  test  verification 
each  have  an  important  role  in  this  process. 
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A  DESCRIPTION  OF  THE  TARGET  ACQUISITION  WORKING  GROUP 

By 

Ronald  A.  Erickson 
Naval  Weapons  Center 
China  Lake,  California 


ABSTRACT 

This  paper  describes  a  newly  formed, 
tri-service  Target  Acquisition  Working 
Group  (TAWG)  which  has  been  formed  under 
The  Joint  Munitions  Effectiveness 
Manuals/Air-to-Surf ace  division  of  The 
Joint  Technical  Coordinating  Group  for 
Munitions  Effectiveness.  The  TAWGfs 
scope,  organization,  and  current 
program  are  described. 

ORGANIZATION 

This  paper  is  a  description  of  a 
recently-formed,  tri-service  Target 
Acquisition  Working  Group  (TAWG) .  The 
purpose  of  the  paper  is  to  make  the 
target  acquisition  community  aware 
of  The  TAWG's  existence,  and  to  im¬ 
prove  communication  and  coordination 
of  work  in  target  acquisition.  Being 
a  tri-service  group,  we  solicit  aid 
from  all  thirds.  The  TAWG  is  organized 
under  The  Joint  Munitions  Effectiveness 
manuals  for  Air-to-Surf ace  (JMEM/AS) . 
Perhaps  some  of  you  are  familiar  with  the 
munitions  effectiveness  manuals  that  are 
produced  by  The  JMEM.  The  Chairman  of 
JMEM/AS  is  Mr.  Thomas  Christie  from  Eglin 
Air  Force  Base,  and  the  Co-chairman  is 
Dr.  Margarite  Rogers  from  NWC,  China  Lake 

The  JMEM/AS  is  a  division  of  The 
Joint  Technical  Coordinating  Group  for 
Munitions  Effectiveness  (JTCG/ME)  which 
is  chaired  by  Dr.  Joseph  Sperrazza.  Dr. 
Sperrazza,  who  is  also  the  technical 
director  of  The  Army  Material  Systems 
Analysis  Agency,  reports  on  JTCG  activi¬ 
ties  directly  to  The  Joint  commanders. 

The  hierarchy  is  shown  partially  in 
Figure  1. 


JOINT  COMMANDERS 


JTCG/ME 

Dr.  J.  Sperrazza,  Chairman  (AMSAA) 
Mr.  R.  Hartmeyer,  Co-Chairman  (AFSC) 

I 

JMEM/AS 

Mr.  T.  Christie,  Chairman  (Eglin) 

Dr.  M.  M.  Rogers,  Co-Chairman  (NWC) 


TAWG 

Mr.  R.  Erickson,  Chairman  (NWC) 

Major  R.  Hilgendorf,  Co-Chairman (WPAFB) 


FIGURE  1.  TAWG  Line-of- 
Responsibility 

OBJECTIVE 

The  JTCG/ME  is  responsible  for 
producing  manuals  which  provide  all 
DOD  elements  a  uniform  basis  for: 

•Weapons  employment  planning  and 
use 

•Determination  of  munitions 
requirements 

•Evaluation  of  new  weapon  concepts 

•Stimulation  of  R  &  D  efforts. 

The  TAWG  objectives  within  this  frame¬ 
work  are  to  quantify  target  acquisition 
performance,  and  to  develop  a  means  of 
predicting  performance.  The  only 
current  bounds  on  The  TAWG  scope  are 
limitations  to 
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(1)  Air-to-ground  target  acquisition 

(TA) 

(2)  real-time  operations. 

We  are  assuming  that  there  will  be  at 
least  two  types  of  customers  for  our 
output:  The  operating  forces,  who 

would  like  to  know  the  level  of  perfor¬ 
mance  to  be  expected  for  any  given 
mission,  and  The  R  &  D  community  who  are 
interested  in  estimating  the  effectiveness 
of  existing,  prototype,  and  hypothesized 
systems. 

SCOPE 

Real-time,  air-to-ground  target 
acquisition  (TA)  can  involve  direct 
vision,  sensor /display  systems,  and 
optical  systems.  These,  in  turn,  can  be 
broken  down  into  subdivisions;  e.g.  sensor/ 
displays  include  television,  infra-red, 
and  radar  systems.  As  far  as  I  have  been 
able  to  determine,  The  TAWG  will  be  dealing 
with  all  of  these,  although  funding  and 
manpower  levels  are  limiting  us  primarily 
to  TA  via  direct  vision  at  the  moment. 

Since  The  TAWG  is  a  tri-service 
group,  we  will  be  dealing  with  a  variety 
of  aircraft:  OH-58  and  AH-56  helicopters, 
the  A-8  Harrier,  The  0V-10,  A-7's  to  F-4*s, 
F-14’s,  and  the  new  AX.  The  visibility 
from  these  aircraft,  mission  types,  and 
speed  and  altitude  regimes  must  be 
included  among  our  target  acquisition 
parameters . 

Any  mission  involving  target  acquisi¬ 
tion  can  be  broken  into  segments;  those 
segments  with  which  The  TAWG  is  primarily 
concerned  are  (1)  target  area  recognition, 
(2)  search  for  the  target,  and  (3)  detec¬ 
tion/recognition/identification  of  the 
target.  Although  factors  like  reconnais¬ 
sance,  pilot  training,  pilot  briefing, 
target  designation,  and  conversion  to 
attack  are  in  mission  success,  The  TAWG 
is  hoping  to  bite  off  only  the  3  items 
listed  above. 

METHODOLOGY 

Several  methodologies  have  been  used 
in  the  past  to  produce  estimates  of  target 
acquisition  performance:  mathematical 
modeling,  simulation  (e.g.  with  photos  or 
terrain  models),  field  testing,  and  use 
of  combat  data.  The  TAWG  intends  to  use 
all  appropriate  methods  and  has,  in  fact, 


partially  organized  around  methodology 
type. 

TARGET  ACQUISITION  DEFICIENCIES 

As  far  as  I  can  tell,  the  reasons 
for  The  TAWG's  existence  are  (1)  the 
paucity  of  usable  data  that  can  be 
applied  to  mission  planning  and  R  &  D 
efforts,  and  (2)  the  need  for  a  tri¬ 
service  group  to  answer  TA  questions 
which  arise.  Some  of  the  deficiencies 
which  we  have  identified  are  listed  in 
Table  1  below.  These  deficiencies  were 
used  to  define  some  of  the  candidate 
TAWG  projects  shown  in  Table  2.  Only 
some  of  these  are  being  pursued  at  this 
time. 

TABLE  1.  SOME  TARGET  ACQUISITION 
DEFICIENCIES 

MATH  MODELING  INADEQUATE 

MOST  MATH  MODELING  NOT  VALIDATED 

CONTEXT  (IMBEDDEDNESS,  CONSPICUOUSNESS) 

NOT  QUANTIFIED 

PERTINENCE  OF  LAB  STUDIES  TO  FIELD  PER¬ 
FORMANCE  NOT  KNOWN 
SEARCH  METHODS  NOT  KNOWN 

INDIVIDUAL  DIFFERENCES  CANNOT  BE  ACCOUNTED 
FOR  OR  EXPLAINED 

MOST  FIELD  TESTS  POORLY  REPORTED 
MOST  FIELD  TESTS  POORLY  STRUCTURED 
FIELD  TESTS  NONCOMPLEMENTARY  (DIFFICULT 
TO  SUMMARIZE  AND  COMBINE) 

IMAGE  QUALITY  DESCRIPTORS  NOT  STANDARDIZED 
NO  STANDARDIZED  METHOD  OF  SYSTEM  ADJUSTMENT 
AND  CALIBRATION 

TARGET/ BACKGROUND  SIGNATURES  NOT  MEANINGFULLY 
QUANTIFIED  OR  DEFINED 

TABLE  2.  CANDIDATE  TAWG  PROJECTS 

TERRAIN/GROUND  TARGET  CLASSIFICATION/ 
QUANTIFICATION 
•IMBEDDEDNESS 
•CONSPICUOUSNESS 
•CAMOUFLAGE 

DISPLAY  QUALITY/PERFORMANCE 

•STANDARDIZE  MEASURES  OF  QUALITY 
•RELATE  QUALITY  MEASURES  TO  OPERATIONAL 
PERFORMANCE 

MATH  MODEL  REVIEW  AND  SYNTHESIS 
SHIP  ACQUISITION  PERFORMANCE 
•USE  OF  TACTICS 
•USE  OF  SENSORS 

•RECOGNITION  CRITERIA  &  PERFORMANCE 
GEOGRAPHIC  ORIENTATION  IN  TARGET  AREA 
•WHERE,  WHEN  TO  SEARCH  FOR  TARGET 
•EXPECTATION  OF  TARGET  PRESENCE 
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USE  OF  FAC’s 

•FAST  FAC’s;  CAN  WE  USE  FAC’s  IN  SITUA¬ 
TIONS  OTHER  THAN  SEA? 

USE  OF  FLARES 

•NUMBER,  LUMINANCE,  PLACEMENT,  ALTITUDE 
•AIRCRAFT  TACTICS 

INITIAL  TAWG  TASKS 

Some  of  the  major  current  TAWG  pro¬ 
jects  are  shown  below  with  some  of  the 
work  components  indicated. 

1.  MATH  MODEL  REVIEW  AND  SYNTHESIS 

•DEVELOP  EVALUATION  CRITERIA 
•REVIEW  AND  CLASSIFY  MODELS 
•EVALUATE  FOR  INPUT  AVAILABILITY, 
VALIDITY,  ASSUMPTION  CONTENT, 
OUTPUT  FORMAT,  COMPUTABILITY 
•RECOMMEND  MODEL ( S )  FOR  TAWG  USE 
•RECOMMEND  MODEL  DEVELOPMENT /REFINE¬ 
MENT  (e.g.  SENSITIVITY  ANALYSES) 
•RECOMMEND  PROGRAM  FOR  PROVIDING  OR 
OBTAINING  REQUIRED  MODEL  INPUTS 

2.  USE  OF  FLARES 

•CONTINUE  FLARE  RESEARCH 
•REVIEW  ALL  FLARE  PROGRAMS 
•SUMMARIZE  FINDINGS  TO  DATE 
•PREPARE  FLARE  USE  MANUAL 

3.  FORWARD  AIR  CONTROLLER 

•PROBLEM  DEFINITIONS,  TASK  ANALYSIS, 
IDENTIFY  CRITICAL  TASKS 
•PRELIMINARY  ESTIMATION  OF  TA  PER¬ 
FORMANCE 

4.  TERRAIN/ TARGET  CLASSIFICATION 

•TERRAIN  TAXONOMY  (PHOTO  STUDY) 
•TERRAIN/TARGET  COMBINATIONS 
IMBEDDEDNESS 
CONSPICUOUSNESS 

•INDICES  OF  TARGET  ACQUISITION 
DIFFICULTY 

5.  FIELD  DATA  SUMMARY 

•DEVELOP  EVALUATION  CRITERIA 
•EVALUATE  REPORTS  OF  TA  FIELD  TESTS 

6.  SUMMARIZE  TERRAIN  MASKING  DATA 

•LITERATURE  SEARCH 
•DEVELOP  EVALUATION  CRITERIA 
•  EVALUATE  REPORTS 
•SUMMARIZE/SYNTHESIZE  DATA 
•IDENTIFY  DEFICIENCIES 

The  work  is  being  carried  on  under  the 
cognizance  of  the  subgroup  chairman 
shewn  in  Table  3.  We  are  currently 
working  under  the  assumption  that  the 
TAfcG  will  exist  for  at  least  2  or  3 
years,  chipping  away  at  assignments  as 


funding  and  staffing  permit.  One  of  the 
larger  areas  which  we  have  not  yet 
addressed  is  that  of  TA  aids.  This  whole 
area  of  sensors,  displays,  cuing  devices, 
etc.  is  ahead  of  us. 

TABLE  3.  TAWG  SUBGROUP  CHAIRMEN 

•TAWG  CHAIRMAN  - 

RONALD  ERICKSON,  NWC 

•MATHEMATICAL  MODELING  - 

DR.  H.  H.  BAILEY,  RAND  _ 

•ANALYTIC  STUDIES  - 

DARRYL  THORNTON,  EGLIN  AFB 

•LABORATORY  SIMULATION  - 

RONALD  BRUNS,  NMC 

•FIELD  TESTING  - 

PAUL  AMUNDSON,  NWC 

•FORWARD  AIR  CONTROLLER  - 

LCOL  ED  WAGGONER,  BROOKS  AFB 
•FLARES  - 

MAJOR  BOB  HILGENDORF,  WPAFB 
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THE  EFFECTS  OF  IMAGE  QUALITY 
ON  INFRARED  TARGET  RECOGNITION* 

Leon  G.  Williams  and  Carl  P.  Graf 
Honeywell  Research  Labs 
St.  Paul,  Minnesota  55113 


The  study  objective  was  to  determine 
the  relative  importance  of  four  image 
quality  variables,  lines/target,  modulation 
transfer,  signal-to-noise  ratio,  and  target 
size  on  target  recognition  in  a  static  dis¬ 
play.  The  study* s  conclusions  are  supported 
by  four  factors:  First,  a  realistic  target 
recognition  task  was  used  —  16  examples  of 
each  of  10  infrared  target  categories  were 
obtained  for  subsequent  processing.  Second, 
a  high  degree  of  control  over  the  stimuli 
was  obtained  by  digital  image  processing 
techniques.  Third,  a  large  volume  of 
experimental  data  were  obtained  —  18  sub¬ 
jects  produced  data  from  15,108  recogni¬ 
tion  trials.  Fourth,  linear  regression 
techniques  were  applied  to  the  data  to 
determine  the  relative  importance  of  the 
severaL  image  quality  variables. 

Method 

The  initial  infrared  pictures  of  mili¬ 
tary  targets  (tank,  truck,  jeep,  gun,  etc.) 
were  produced  at  the  Honeywell  Proving 
Grounds  in  Minnesota.  An  IR  sensor  in  the 
1.0-5. 5  micron  range  displayed  the  thermal 
images  on  a  CRT  for  a  range  of  temperature 
conditions  (night,  day,  target  hot,  target 
cold) .  Photographs  were  made  of  these 
images .  The  desired  levels  of  image  degra¬ 
dation  were  achieved  with  the  digital  image 
processing  facility. 

Each  target  transparency  (1x2  inches)  was 


*This  wDrk  has  been  sponsored  by  the  Air  Force 
Avionics  Laboratory  at  Wright-Patterson  AFB 
under  Contract  No.  F33615-72-C-1303. 


written  on  magnetic  tape  as  250  records, 
each  record  containing  500  bytes,  by  a 
drum  scanning  technique.  The  main  image 
degradation  was  achieved  by  digital  com¬ 
puter  processing.  First,  a  Gaussian 
point  spread  function  was  applied  to 
create  the  specified  level  of  modulation 
transfer.  Then,  Gaussian  noise  was  added 
to  each  sample  picture  element.  Finally, 
the  line  scan  structure  was  created.  The 
output  tape  was  made  into  a  0.5  x  1.0 
inch  transparency  by  a  drum  writer.  A 
total  of  1120  transparencies  were  made. 

The  experimental  set-up  consisted  of 
a  modified  35mm  slide  projector,  a 
quality  rear  projection  screen,  a  timer 
and  printer,  and  a  logic  controller  for 
the  apparatus.  In  a  trial  a  transparancy 
was  projected  at  one  of  four  magnifica¬ 
tions  to  achieve  mean  target  heights  rang¬ 
ing  from  10  to  80  arc  minutes  of  visual 
angle.  The  subject  had  five  seconds  to 
make  a  recognition  response.  His  response 
and  time  were  recorded. 

Design 

The  study  consisted  of  two  parts.  In 
the  first  part,  initial  evaluation,  the 
objective  was  to  determine  the  degrada¬ 
tion  levels  to  be  used  for  the  main 
experiment.  This  was  accomplished  by 
degrading  a  set  of  four  pictures  at  all 
levels  of  interest  and  inspecting  the 
output  transparencies .  The  main  out¬ 
come  of  this  effort  was  the  decision  to 
use  more  noise  and  poorer  MTF  for  those 
pictures  having  more  scan  lines. 

In  the  main  experiment  there  were  224 
degradation  conditions  defined  by  four 
levels  (with  an  exception)  of  each  of  the 
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four  main  image  quality  variables.  The 
nominal  ranges  of  each  variable  were  as 
follows:  1)  lines/target  (3  to  24);  2) 

MTF  (0.08  to  0.47  modulation  at  one-half 
the  cutoff  frequency) ;  3)  signal-to-noise 
(1:1  to  12:1);  4)  target  size  (10  to  80  arc 
minutes).  There  were  18  subjects,  6  with 
considerable  experience  with  FLIR  imagery. 

All  subjects  saw  the  total  set  of  1120 
transparencies,  280  during  a  three  hour 
training  session  and  the  remaining  840  during 
the  experimental  trials. 

Precise  measurements  of  size  and  density 
were  made  on  each  of  the  original  target 
transparencies .  There  were  two  reasons  for 
doing  this:  To  define  the  values  of  the 
degradation  and  to  determine  which  target 
characteristics  other  than  the  four  main 
image  quality  variables  were  related  to 
recognition.  The  following  measurements 
were  made  on  each  target:  Height,  width, 
minimum  dimension,  maximum  dimension,  area 
on  the  transparency,  perimeter,  mean  edge 
contrast,  mean  density,  background  density 
maximum  picture  density,  and  minimum  picture 
density.  The  target  measures  were  combined 
with  the  measures  of  image  quality  to  pro¬ 
vide  a  variety  of  combined  measures.  Two 
examples  are  target  width  in  terms  of  number 
of  scan  lines  and  signal-to-noise  ratio.. 

A  measure  of  resolving  power  was  made  for 
each  degradation  level  using  the  Air  Force 
Resolution  Chart.  Resolving  power  was  de¬ 
fined  by  the  minimum  size  3-bar  pattern  which 
could  be  visually  separated  at  that  degrada¬ 
tion  condition.  It  was  found  that  resolving 
power  depended  almost  entirely  on  the  number 
of  lines/target  and  target  size  (i.e. , 
magnification) . 

Results 

Data  from  15,108  trials  were  analyzed. 

The  overall  recognition  accuracy  was  58 
percent;  the  mean  response  time  was  2.75 
seconds.  The  correlation  of  response  time 
and  accuracy  was  -0.93.  Although  the  ex¬ 
perienced  subjects  recognized  about  as  many 


targets  as  the  students,  they  were  about  one 
second  slower.  Accuracy  was  much  greater 
for  some  targets,  such  as  the  tank  or  truck, 
than  others. 

The  main  result  of  the  regression 
analysis  was  that  lines/target  and  noise 
in  linear  combination  accounted  for  over 
60  percent  of  the  variance  in  both  time 
and  accuracy  for  the  224  degradation 
conditions.  Resolving  power,  however, 
was  a  better  predictor  of  performances 
than  lines/target.  Resolving  power  and 
noise  together  accounted  for  78  percent 
of  the  variance  for  the  224  degradation 
conditions . 

What  target  properties  make  it  easy 
or  hard  to  recognize?  Two  kinds  of 
variables  were  studied,  target  size,  and 
target-background  contrast.  If  recogni¬ 
tion  was  predicted  from  target  size  alone, 
then  size  was  best  measured  in  terms  of 
resolving  power.  Specifically,  target 
perimeter  /  expressed  in  terms  of  resolv¬ 
ing  power,  was  the  best  single  predictor 
of  performance.  This  was  similar  to  John 
Johnson's  widely  accepted  measure,  namely, 
target  height  measured  in  terms  of  resolv¬ 
ing  power.  Target  size,  expressed  in  terms 
of  visual  angle  was  a  surprisingly  poor 
predictor.  Target-background  contrast  was 
measured  better  by  mean  edge  contrast  than 
by  the  simpler  overall  contrast  measure. 

In  summary,  recognition  was  best  oredicted 
by  the  following  variables:  resolving  power 
noise,  target  perimeter,  and  edge  contrast. 

Re  comme  nd  a t i o n 

The  study  results  strongly  suggested 
that  future  effort  be  concerned  with  a 
more  precise  understanding  of  resolving 
power.  Specifically,  how  do  resolving 
power  and  recognition  depend  on  the 
primary  aspects  of  the  sensor-display 
line  scan  structure;  i.e.,  sensor  scan¬ 
ning  aperture — size,  shape,  and  spacing; 
and  display  writing  aperture — size,  shape 
and  spacing. 


THIS  IS  AN  ABSTRACT  OF  THE  PAPER  PRESENTED  AT  THE  SYMPOSIUM. 
FOR  FURTHER  INFORMATION,  THE  AUTHORS  MAY  BE  CONTACTED. 


UNCLASSIFIED 


196 


AIM  ANALYSIS  OF  RADAR  TARGET  ACQUISITION 
PERFORMANCE  EFFECTS  ON  SURVEILLANCE  AND 
DIRECTION  EFFECTIVENESS  THROUGH  SIMULATION 

by 

Donald  A.  Topmiller,  PhD 
and 

Wayne  L.  Martin 

Aerospace  Medical  Research  Laboratory 
Aerospace  Medical  Division 
Air  Force  Systems  Command 
Wright-Patterson  Air  Force  Base,  Ohio 


OFFICE  OF  NAVAL  RESEARCH 

TARGET  ACQUISITION  SYMPOSIUM 

'JAVAL  TRAINING  CENTER,  ORLANDO,  FLORIDA  /  14,15,16  NOVEMBER  1972 


197 


r 


198 


AN  ANALYSIS  OF  RADAR  TARGET  ACQUISITION  PERFORMANCE  EFFECTS  ON 
SURVEILLANCE  AND  DIRECTION  EFFECTIVENESS  THROUGH  SIMULATION 


Donald  A.  Topmiller,  Ph.D. 
and 

Wayne  L.  Martin 

Aerospace  Medical  Research  Laboratory 
Aerospace  Medical  Division 
Air  Force  Systems  Command 
Wright-Patterson  Air  Force  Base,  Ohio 


ABSTRACT 

This  paper  summarizes  studies  done  in 
command/control  simulations  of  surveillance 
functions  where  the  operator  is  required 
to  detect  threats  through  an  automated 
digital  radar  system  and  of  weapons  direc¬ 
tion  functions.  The  effectiveness  of  the 
operators  performance  depends  largely  on 
the  ability  of  the  operator  to  acquire  a 
nonimaging  and  digitally  displayed  target 
and  to  identify  what  he  perceives  as  being 
differ snt  from  nontarget  returns  using  the 
time  course  history  of  the  returns.  He 
must  tien  assign  this  information  to  a 
computer  programmed  to  automatically  track 
the  progress  of  the  target  across  the 
survei!Lled  area.  The  operator  may  perform 
this  coordinate  position  identification 
and  subsequent  track  initiation  with  a 
light  pen  or  through  cursor  positioning 
with  a  track  ball,  joystick,  or  other 
means.  The  weapons  direction  function 
must  bo  performed  as  effectively  as  the 
surveiiJLance  function  in  order  to  achieve 
maximum  defense  capability. 

INTRODUCTION 

Recognizing  that  the  primary  purpose 
of  thin  symposium  is  to  appraise  the 
state-of-the-art  in  acquiring  targets 
which  have  "image11  characteristics,  this 
paper  is  intended  to  present  a  wholly 
different  problem  in  target  acquisition, 
viz. ,  that  of  acquiring  targets  through 
nonimaging  sensors  (i.e.,  digitalized 


radar).  The  problem  of  identifying 
target  characteristics,  as  well  as 
problems  associated  with  counteracting 
techniques  designed  to  purposely  distort 
these  characteristics  (e.g.,  camouflage 
for  imaging  targets  and  ECM  for  elec¬ 
tronically  sensed  targets),  is  more 
universal  than  that  associated  with 
digital  radar  systems.  However,  the 
main  intent  of  this  paper  is  to  explore 
target  acquisition  and  directing  capa¬ 
bilities  in  "automated"  digital  radar 
systems. 

To  discuss  this  area  and  its 
associated  experimental  findings  fran 
a  two-year  program  of  computer-based 
simulation  first  requires  seme  conceptual 
distinctions, 

Behaviorally,  including  both  visual 
mechanisms  and  the  more  general  realm  of 
perception,  imaging  capability  has  a 
greater  experiential  and  experimental  data 
base  to  draw  upon  than  nonimaging  informa¬ 
tion  structures  and  displays.  After  all, 
the  giant  strides  in  electronic  and 
computer  technologies  have  only  occurred 
within  the  last  30-40  years  to  make  this 
possible.  Biologically,  man  has  evolved 
into  an  astute  visual  perceptual  sensor 
of  his  environment;  viz.,  all  information 
transformation  is  biologically  and  psy¬ 
chologically  determined.  Alternately, 
technology  growth  has  enabled  man  to  be 
a  secondary  detector  and  perceiver  of 
his  environment,  wherein  the  information 
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has  been  physically  sensed  (RF  (radio 
frequency)  energy,  IR  (infrared)  energy, 
etc. J  ,  and,  of  necessity,  filtered,  trans¬ 
formed  and  displayed.  Albeit  these  filters 
and  transformation  schemes  were  developed 
by  man  himself,  frequently  they  are  not 
designed  to  maximally  exploit  the  operator’s 
information  receiving-processing  capabili¬ 
ties.  With  analog  radar  and  infrared 
sensors,  there  are  built-in  perceptual  cues 
that  are  not  necessarily  remote  from  the 
"true11  image.  However,  with  digital  radar, 
the  sheer  process  of  digitalization  of  the 
sensed  data,  normally  driven  by  temporal 
considerations  such  as  antenna  scan  rate, 
provides  a  greater  challenge  to  the  engi¬ 
neering  psychologist  responsible  for  devel¬ 
oping  the  research  necessary  to  define 
optimal  information,  transformation, 
structure  and  display  techniques. 

Turning  now,  from  the  information 
sensing  and  transformation  stages  of  the 
target  acquisition  process,  to  the  command- 
control  implications,  higher  order  psycho¬ 
logical  functions  are  demanded  from  the 
operator:  once  the  target  has  been  acquired, 
what  resources  should  be  committed  to  most 
effectively  eliminate  the  threat.  A  very 
limited  facet  of  this  problem  has  been 
addressed  through  recent  Airborne  Warning 
and  Control  (AWACS)  weapons  direction  simu¬ 
lations  in  our  laboratory.  A  more  extensive 
aspect  of  the  "inferential  process"  involved 
in  tactical  decision-making  was  treated  in 
a  six-year  program  at  Ohio  State  University 
Research  Foundation  and  has  been  extensively 
documented  elsewhere  (Howell,  1967;  Howell 
and  Gettys,  1968). 

SOME  CONCEPTS  IN  DIGITALIZED 
SURVEILLANCE  SYSTEMS 

Radar  History 

One  of  the  often  professed  advantages 
of  the  "automated"  surveillance  radar  system 
is  the  ability  to  sense,  store  and  play  back 
radar  returns  from  several  sensing  antennae. 
The  "storing"  function  allows  the  systems 
to  be  designed  such  that  the  actual  history 
of  the  scans  can  be  played  back  at  any  rate 
of  which  the  computer  is  capable.  Recent 
research  (Scanlon  et  al,  1971)  has  shown  the 
advantage  of  time-compressed  radar  returns 
for  display  purposes  since  the  targeting 
returns  can  be  more  readily  distinguished 
from  nontarget  information,  i.e.,  clutter. 


Tracks-Radar  Trails 

The  actual  "playback  function"  in 
a  digital  radar  system  creates  a  series 
of  coherent  blips  on  the  scope  which  can 
be  defined  as  a  radar  trail.  Returns  that 
are  "canned"  to  represent  false  reports 
may  have  any  degree  of  temporal/spatial 
similarity  to  data  that  have  been  "canned" 
to  represent  the  timed  history  of  actual 
aircraft.  The  "canning"  can  also  define 
the  start  and  stop  positions  of  the  trail, 
velocity  of  the  trail,  type  of  return 
(i.e.,  Mark  X,  search  or  false  reports) 
and  time  of  entry  for  each  trail. 

Clutter 

Since  any  RF  energy  reflecting 
surface  (such  as  automobiles,  bridges, 

TV  towers,  etc.)  can  produce  a  return  in 
a  digital  system,  without  exotic  filter¬ 
ing  techniques  the  system  cannot  discrim¬ 
inate  noisy  returns  or  clutter  from  true 
target  returns.  On  more  traditional 
analog  radar  systems,  the  return  patterns 
can  give  perceptual  cues  as  to  distinguish¬ 
ing  characteristics  of  clutter  versus 
targets  which  enable  the  surveillance 
operator  to  exploit  this  information.  In 
the  simulations  of  digital  radar  systems, 
the  clutter  is  typically  generated  by  same 
statistical  distribution;  in  several  of 
our  studies,  we  mixed  uniform  and  expo¬ 
nential  distributions.  This  method  pro¬ 
vided  a  realistic  distribution  of  clutter 
that  was  unevenly  distributed  over  the 
surveillance  area  and  contained  "clumping". 

Blip/Scan  Ratios 

Because  of  other  noisy  sources, 
including  atmospheric  degradation,  inten¬ 
tional  countermeasures,  etc.,  not  every 
energy  pulse  from  a  scan  of  the  antenna 
will  have  an  associated  return.  The  ratio 
of  the  emitted  scans  to  the  sensed  returns 
is  referred  to  as  the  blip/scan  ratio. 

This  parameter  can  be  varied  in  our  simu¬ 
lations  (typically  from  .5  to  .9)  and 
represents  a  true  capability  to  systemat¬ 
ically  degrade  the  quality  of  the  informa¬ 
tion  available  to  the  surveillance 
operator. 
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Track  Failure 

The  tracking  algorithms,  in  general, 
are  subject  to  failure  due  to  many 
influences  such  as  maneuvering  aircraft, 
crossing  aircraft,  blip/scan,  clutter  and 
radar  position  errors.  No  matter  how 
effective  the  algorithm  is,  certain  radar 
data  systems  will  cause  a  tracking  failure. 
The  failure  occurs  when  the  predicted 
position  is  caused,  by  one  or  more  of  the 
above  influences,  to  be  at  the  wrong 
coordinates  with  respect  to  the  radar  data 
from  the  aircraft  being  tracked.  When  this 
happens,  the  tolerance  can  be  opened  up. 

THE  OPERATOR  AS  A  THREAT  DETECTOR 
Jingle  Operator  Simulations 

Early  concerns  in  advanced,  automated 
digital  radar  systems  dealt  with  the 
number  of  targets  that  a  single  operator 
could  be  expected  to  detect  and  assign  to 
the  computer  for  automatic  tracking.  The 
two  primary  threat/radar  environment 
parameters  which  were  thought  to  contribute 
most  significantly  to  the  operators  abil¬ 
ity  to  perform  the  track  initiation  and 
maintenance  functions  were  clutter  and  the 
rate  at  which  the  penetrators  entered  the 
display  area. 

Robert  G.  Mills,  from  our  group,  was 
principally  responsible  for  developing  the 
single-operator  simulation  studies  which 
are  more  thoroughly  discussed  in  two  AMRL 
technical  reports  (AMRL-TR-7 0-103  and  AMRL- 
TR-71-76). 

The  simulation  facility  consists  of  an 
IBM  360-40H  computer  system  which  drives 
four  IBM  2250  cathode  ray  tube  (CRT)  graph¬ 
ics  terminals  which  are  used  as  the  primary 
control/display  units.  The  terminal  has  a 
CRT  display  surface  of  144  square  inches 
(12  in.  x  12  in.).  The  CRT  is  coated  with 
P-7  phosphor  which  persists  on  the  average 
of  400  ms  after  excitation.  The  terminal’s 
light  pen  and  programmed  function  keyboard 
(PFK)  consist  of  32  response  keys  for  opera¬ 
tor  communication  with  the  computer. 

Diagram  A  is  a  representation  of  a  time 
exposure  photograph  taken  over  two  radar 
updates  (six  scans  are  shown)  during  an 
early  20-second  period  of  a  mission.  A 


number  of  targets  are  shown,  several  of 
which  have  been  initiated. 

The  subject’s  task  was  to  monitor  his 
surveillance  area  and  to  perform  track 
initiation  and  maintenance  functions.  The 
initiation  function  required  the  subject 
to  light  pen  any  one  of  five  historical 
returns  suspected  of  representing  a 
target.  Following  this  action,  the  sub¬ 
ject  activated  a  response  key  labeled  NT 
(new  track).  The  maintenance  function 
was  performed  in  the  same  manner,  except 
that  the  subject  pressed  a  response  key 
labeled  OT  (old  track)  instead  of  NT 
when  he  detected  the  disassociation  of  a 
track  with  a  signature. 

As  mentioned  previously,  two  independ¬ 
ent  variables  (clutter  and  target  intro¬ 
duction  rate)  were  manipulated  in  the 
simulation.  The  levels  of  clutter  were 
.016  pieces/square  nautical  mile  (Cl,  90 
total  pieces/scan),  .032  pieces/square 
nautical  mile  (C2,  180  total  pieces/scan), 
and  .064  pieces/square  nautical  mile  (C3, 
360  total  pieces/scan).  The  levels  of 
target  introduction  rate  were  1,  2,  3>  and 
4  targets/minute.  The  mission  took 
approximately  50  minutes  of  real  time,  and 
subjects  completed  three  missions/week 
over  a  two-month  period. 

One  of  the  findings  of  this  simulation 
is  the  positive  relationship  between  the 
two  surveillance  functions  of  initiation 
and  maintenance.  This  relationship  is 
not  necessarily  surprising  (see  Figure  l) 
and  perhaps  could  have  been  predicted  from 
basic  research  on  attention  (vigilance) 
behavior.  In  Figure  1  the  parabolic 
function  demonstrates  that  the  more  effort 
the  subject  devoted  to  maintaining  targets 
and  keeping  his  display  cleaned  up,  the 
higher  the  probability  of  initiating  new 
targets  as  they  entered  the  surveillance 
area. 

Figure  2  shows  the  effects  of  the  two 
independent  variables  of  clutter  and  tar¬ 
get  introduction  rate.  The  surprising 
result  is  that  clutter  appears  to  have 
very  little  effect  on  probability  of 
correct  initiation  and  was  not  a  signifi¬ 
cant  main  effect  in  an  analysis  of  vari¬ 
ance,  whereas  target  introduction  rate 
was  highly  significant  and  accounted  for 
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Diagram  A.  Representation  of  a  Time  Exposure  Photograph  of  the  CRT  Display 
Showing  One  Display  Update  During  a  20-Second 
Mission  Simulation  Period 


the  majority  of  variance.  Evidently  the 
visual-perceptual  discriminations  of  the 
operator  are  better  than  commonly  thought 
by  the  surveillance  community. 

Mills  summarizes  his  conclusions  and 
recommendations  as  follows: 

•  Operator  performance  is  degraded 
particularly  under  target  introduction 
rates  of  3  and  4  per  minute.  Until  other 


alternatives  are  investigated  two  operators 
should  perform  the  surveillance  tasks  when 
introduction  rates  reach  these  levels. 

•  Operator  performance  is  relatively 
unaffected  by  clutter  density  under  the 
time  compression  display  format,  however, 
further  investigations  should  be  performed 
to  determine  whether  or  not  this  conclusion 
holds  under  less  idealized  radar  display 
conditions . 
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Probability  of  Correct  Track  Initiation 


Nuabtr  of  Track*  Maintained 

Figure  1.  Probability  of  Correct  Track 
Initiation  as  a  Function  of  Number 
of  Tracks  Maintained  for  Two  Mission 
Conditions  (Introduction  Rate  =  4, 
Clutter  Density  =  0.16  and  0.32). 
Mission  Time  Covered  =  50  Minutes 
Number  of  Subjects  ■  6. 

key 

Target  Velocity 

-  500  knots 

- - —  900  knots 

Target  Introduction  Rate 


Figure  2.  Summary  Data  for  First  Two 
Single-Operator  Simulations  of 
Command/Control  Surveillance 

Functions.  Data  are  Averages  Across 
Operators . 

Multi-Operator  Simulations 

This  program  provided  a  surveillance 
system  simulation  in  which  up  to  four 
operators  could  perform  the  tasks  of  track 
initiation  and  maintenance  under  various 
digital  display  parameters. 

Some  of  the  parameters  were  fixed  but 
could  be  made  to  assume  any  value  appro¬ 
priate  to  the  research  problem  at  hand. 


These  fixed  parameters  were  established  at 
McanM  time  and  could  be  manipulated  in  any 
fashion  to  provide  the  following  variable 
characteristics:  start  and  stop  position 

of  trail,  velocity  of  trail,  type  of  return 
(i.e.,  Mark  X,  search,  or  false  reports), 
and  time  of  entry  for  each  trail. 

In  combination,  these  parameters  could 
be  manipulated  as  independent  variables  in 
an  experiment  to  provide  variable  introduc¬ 
tion  rate,  variable  enemy  attack  strategies, 
any  mix  of  trail  velocities,  a  broad  range 
of  trail  lengths,  false  report  density,  any 
blip/scan  ratio  and  radar  return  errors 
could  be  manipulated  mathematically/statis- 
tically. 

Returns  that  are  11 canned"  to  represent 
false  reports  may  have  any  degree  of  tempo¬ 
ral/spatial  similarity  to  data  that  have 
been  "canned"  to  represent  the  timed  history 
of  actual  aircraft. 

Once  the  "canned"  portion  of  the 
program  has  been  constructed,  a  large 
portion  of  what  the  operator  saw  (and 
thus  a  good  deal  of  his  work  load)  was 
determined. 

The  values  of  other  relevant  param¬ 
eters  could  be  designated  at  the  time  a 
particular  experimental  condition  is  run. 
Given  the  identical  "canned"  program 
(or  set  of  programs),  the  additional 
independent  variables  which  can  be  manip¬ 
ulated  are  as  follows: 

•  probability  of  track  failure  - 
Mark  X  data 

•  probability  of  track  failure  - 
Search  data 

•  heading  tolerance  permitted  on 
initiate  &  reinitiate  actions  -  Mark  X 

•  heading  tolerance  permitted  on 
initiate  &  reinitiate  actions  -  Search 

•  drift  threshold  -  used  to  hold 
off  drift-in  logic  until  operator  has  a 
chance  to  act 

•  trouble  track  indicator  -  number 
of  scans  a  track  may  drift  before  it  is 
considered  a  trouble  track 

•  correlation  specification  for 
track-to-air craft  correlation  -  this 
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defines  the  size  of  the  area  around  the 
tail  of  a  drifting  track  within  which 
search  returns  will  become  correlated 
with  the  track 

•  entire  geographic  background 
display  including  boundary  lines,  stra¬ 
tegic  location  designators,  and  latitude/ 
longitude  markers 

Several  display  options  were  pro¬ 
vided  through  program  functions  keys. 

The  operator  (subject)  may  omit  or  include 
any  of  the  following  categories  of  data: 

•  uncorrelated  history  data  for 
search  radar 

•  correlated  history  data  for 
search  radar 

•  uncorrelated  present  position  data 
for  search  radar 

•  correlated  present  position  data 
for  search  radar 

•  uncorrelated  history  data  for  Mark 

X  radar 


Uncorr,  Search 


Hist.) 


Pres. 


Corr.  Search 


Figure  3.  Programmed  Function  Key  Layout 


•  correlated  history  data  for  Mark  X 

radar 

•  uncorrelated  present  position  data 
for  Mark  X  radar 

•  correlated  present  position  data 
for  Mark  X  radar 

•  trouble  tracks 

•  good  tracks 

•  boundary  lines 

•  strategic  locations 

•  latitude/longitude  markers 
Figure  3  shows  the  layout  of  PFK’s, 

The  operator  also  provided  a  heading 
prediction  and  expanded  the  display  via  the 
PFK’s  as  indicated.  An  initiate  action, 
for  example,  required  depressing  the  ini¬ 
tiate  key,  light  penning  the  most  recent  of 
a  series  of  returns  and  depressing  the 
appropriate  heading  prediction  key. 


All  operators  actions,  whether  they 
were  good  or  bad  (and  if  bad,  the  cause), 
and  every  event  occurring  throughout  a 
simulated  set  of  radar  data  were  recorded 
on  a  scan-by-scan  basis.  The  status  of 
each  track  (i.e.,  drift  or  follow  mode) 
was  also  recorded.  At  the  conclusion  of 
an  experimental  session,  the  frequency  of 
each  event  category  was  compiled.  The 
definitions  of  these  categories  are  shown 
in  Table  I. 

Performance  measures  were  also  evalu¬ 
ated  automatically  at  the  end  of  each 
experimental  session.  These  measures, 
together  with  their  derivation,  are  pro¬ 
vided  in  Table  II.  Measures  1-6  relate 
to  operator  actions  and  represent  several 
ways  of  describing  what  the  operator  was 
doing  and  how  well  he  was  performing. 

Measure  7  was  taken  as  the  overall 
surveillance  system  performance.  "Good 
Tracking"  was  the  case  when  a  track  was 
followed  properly.  When  a  track  was  in 
drift  status,  "Bad  Tracking"  was  said  to 
be  the  case. 
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TAHLE  I 


TABLE  H 


DEFINITION  OF  EOTT  CATEGORIES 


PERFCRMANCI  MEASURES 


Sc&no 

EN 

SC 

IB 

IB  -  AI 
RG 

RB  -  WT 
IB  -  AL 
RB  -  AL 
IB  -  BH 
RB-BH 
IB  -  AF 
RB  -  AF 
RB  -  TD 
DG 

DB  -  SA 
DB  -  NT 
E)G 

EE  -  NO 
EB  -  NT 
EB  -  MT 


Scane  for  thie  run 
Aircraft  entered 
Scale  Change 
Initiate  Good 

Initiate  Bad  -  Already  Initiated 
Reinitiate  Good 

Reinitiate  Bad  -  Wrong  Track  associated  with  trail 
Initiate  Bad  -  Aircraft  Landed 
Reinitiate  Bad  -  Aircraft  Landed 
initiate  Bad  -  Bad  Heading  prediction 
Reinitiate  Bad  -  Bad  Heading  prediction 
Initiate  Bad  -  Already  being  Followed 
Reinitiate  Bad  -  Already  being  Followed 
Reinitiate  Bad  -  Track  already  Dropped 
Drop  Good 

Drop  Bad  -  Track  Still  Aseociated  with  a  trail 
Drop  Bad  -  No  Track  preeent 
Exchange  Good 

Exchange  Bad  -  Not  the  Original  aircraft 
Exchange  Bad  -  One  aircraft  has  No  Tract 
Exchange  Bad  -  One  aircraft  was  Mark  X  (Ten) 


DI 

DO  -  ST 


DO  -  AL 
TT 


3L 

IB  -  CL 
RB  -  CL 
DG  -  CL 
JSB  -  CL 

::b  -  af 


«  Drift  In 

■=  Drift  Out  -  Statietics 
=*  Drift  Out  -  Aircraft  Landed 

■  Trouble  Track  Status 

*  System  Limits 

=  Initiate  Bad  -  Clutter 

■  Reinitiate  Bad  -  Clutter 

*  Drop  Good  -  Clutter 

*  Exchange  Bad  -  Clutter 

Ignored  *  Thoee  Initiate  Bad  -  Already  being  Followed 
actions  that  occur  within  2  ecane  of  an  IG 


RB  -  AF  Ignored 


Those  Reinitiate  Bad  -  Already  being  Followed 
actione  that  occur  within  2  scans  of  an  RG 


The  purpose  of  this  study  was  to 
evaluate  operator  and  surveillance  system 
performance  at  various  trail  lengths, 
across  blip/scan  ratios  for  1,  2,  and  3 
man  teams.  The  effort  was  performed  to 
settle  the  question  concerning  how  many 
history  returns  (history  +  present  =  trail 
length)  must  be  stored  and  displayed  to 
the  operator. 


1.  Mean  Firet  Good  Initiate  Time  - 

EN 

c 

IG  (SCAM  #  IG  -  SCAN  0  EN)  %  10Q 
Total  0  IG 

2.  Percent  of  Initiate  Actione  That  Were  Correct  - 

Total  #  IG 

-  x  100 

Total  0  10  +  IB-41  +  IB-AL  +  IB-BH  +  IB-AF  +  IB-CL 

3.  Percent  of  Aircraft  Correctly  Initiated  - 

T°^  t  10  X  loo 
Total  #  EH 


U.  Percent  of  Reinitiate  Actione  That  Were  Correct  - 


_ Total  0  RG _ 

Total  #  RG  +  RB-WT  +  RB-AL  +  RB-BH  +  RB-AF  +  RB-TD  +  RB-CL 


X  100 


5.  Percent  of  Good  Maintenance  Actione  Taten  - 


Total  0  RG  +  DG  +  IK 


(Total  #  RG  +  DG  +  B3  +  RB-WT  +  RB-AL  +  RB-BH  +  RB-AF  +  RB-TD  + 
DB-NT  +  DB-SA  ♦  EB-NO  +  EB-NT  +  EB-MT  +  RB-CL  +  EB-CL) 


x  100 


6.  Groea. Merit  -  [[Good  -  Bad)  /  (Good  +  Bad)]  x  100  r 

Where:  Good  -IG  +  RG  +  DG  +  BG  +  DG-CL 

Bad  -  IB-AI  +  RB-WT  +  IB-AL  +  RB-AL  +  IB-BH  + 

RB-BH  +  IB-AF  +  RB-AF  ♦  RB-TD  +  DB-NT  + 

DB-SA  +  EB-NO  ♦  EB-NT  +  EB-NT  +  IB-CL  + 

RB-CL  +  EB-CL 


7.  Overall  Percent  of  Scane  With  Good  Tracking  » 

Total  0  Scane  With  Good  Tracking  ^  10Q 

(Total  0  Scane  With  Good  Tracking  ♦ 

Total  0  Scane  With  Bad  Tracking) 


Previous  studies  using  a  BUIC  III 
type  simulation  (Pearson,  1972)  showed 
that  a  trail  length  of  7  was  adequate  for 
tracking  through  noise.  Blip/scan,  how¬ 
ever,  was  held  at  unity  in  these  studies. 

The  mission  parameters  were: 

•  Mission  duration  of  28  minutes 

•  Aircraft  entered  the  200  x  200 
nautical  mile  surveillance  area  at  the 
average  rate  of  2/minute  (o=  .75/minute) 
with  a  mean  velocity  of  500  nautical  mph 
(°  =  150  nautical  mph) 

•  Radar  return  error  was  distribu¬ 
ted  as  a  Rayleigh  with  o  =  1  nautical  mile 

•  Probability  of  track  failure  = 

.13 

•  False  report  rate  (or  clutter) 
was  2/scan  across  each  of  the  trail 
lengths,  resulting  in  10,  14,  and  18  total 
pieces  distributed  as  a  uniform  bivariate 
in  X  and  Y  display  coordinates 
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The  experimental  design  was  as  follows: 

•  The  design  employed  a  3^3x3 
factorial  with  4  replications. 

•  Factor  levels: 

•  Trail  length  was  manipulated 
at  values  of  5,  7  and  9. 

•  Blip/scan  was  varied  at  levels 
of  .5,  .7  and  .9. 

•  Teams  of  1,  2  and  3  operators 
performed  track  initiation  and  maintenance 
tasks. 

•  Added  experimental  control  was 
provided  by  using  the  same  trained  opera¬ 
tors  who  participated  in  earlier  studies. 

An  original  4-man  team  was  broken  down 
into  1-,  2-  and  3-man  combinations  such 
that  counterbalancing  was  maintained  in  so 
far  as  possible  with  respect  to  an  individ¬ 
ual  operator's  contribution  to  performance 
by  each  team  size.  To  further  enhance 
control,  all  the  data  required  for  four 
replications  across  group  size  within  any 
particular  trail  length  X  blip/scan  combi¬ 
nation  was  collected  on  a  single  day.  The 
only  restriction  was  that  no  operator  ran 
more  than  two  28-minute  sessions  in  a  row 
and  each  had  at  least  a  28-minute  break 
between  all  other  sessions. 

Analyses  of  variance  were  performed  on 
each  of  the  seven  performance  measures. 
Summaries  of  these  analyses  are  provided 
for  only  the  Mean  First  Good  Initiate  Times 
and  Overall  Percent  Good  Tracking  and  are 
shown  in  Tables  III  and  IV.  Likewise, 
only  the  figures  plotting  means  for  First 
Good  Initiate  Time  and  Overall  Percent  of 
Scans  with  Good  Tracking  are  presented. 
Figures  4  through  6  can  be  overlayed  and 
the  lack  of  an  effect  of  trail  length  on 
Mean  First  Good  Initiate  Time  easily 
observed.  Likewise,  Figures  7  through  9 
portray  essentially  the  same  nonsignificant 
effect  of  trail  length.  Blip/scan  had  a 
statistically  significant  effect  (at  least 
at  the  .05  level)  for  every  performance 
measure,  while  trail  length  was  not  signif¬ 
icant  for  any  of  the  measures.  Group  size 
had  a  significant  effect  on  all  measures 
except  Percent  of  Initiate  Actions  That 
Were  Correct.  The  only  interaction  found 
to  be  significant  was  on  the  Overall  Percent 
of  Scans  With  Good  Tracking  measure  for 
trail  length  versus  blip/scan.  Increasing 


the  trail  length  does  enhance  overall  sur¬ 
veillance  system  performance  at  lower  blip/ 
scan  levels. 

TABLE  III 


ANALYSIS  OF  VARIANCE  FOR  MEAN  FIRST  GOOD  INITIATE  TIME 


Source  of  Variance 

df 

MS 

F 

A  (Group  Size) 

2 

2595. 14 

24.751* 

B  (Trail  Length) 

2 

60.74 

.579 

C  (Blip/Scan) 

2 

8960.48 

85.459* 

A  x  B 

4 

35.25 

.336 

B  x  C 

4 

208.59 

1.989 

A  x  C 

4 

56.61 

.539 

A  x  B  x  C 

8 

20.51 

.196 

Error:  Within  Treatments 

81 

104.85 

Total 

107 

*  p<  .001 


TABLE  IV 


ANALYSIS  OF  VARIANCE  FOR  OVERALL  PERCENT  OF  SCANS  WITH  GOOD  TRACKING 


Source  of  Variance 

df 

F 

A  (Group  Size) 

2 

201.64 

19.714* 

B  (Trail  Length) 

2 

23.92 

2.339 

C  (Blip/Scan) 

2 

504.71 

49.347* 

A  x  B 

4 

5.28 

.516 

B  x  C 

4 

40.00 

3.912* 

A  x  C 

4 

18.75 

1.833 

A  x  B  x  C 

8 

4.70 

.459 

Error:  Within  Treatments 

81 

10.23 

Total 

107 

*  p  <  .01 


The  seemingly  discrepant  observation 
for  blip/scan  =  .5,  trail  length  =  7  in 
Figure  8  is  probably  due  to  the  fact  that 
this  particular  combination  was  the  first 
cell  in  which  performance  data  were 
collected  in  this  experiment.  Subjects  had 
undergone  considerable  training  at  the 
identical  mission  parameters  but  appar¬ 
ently  still  had  not  reached  asymptotic 
performance  when  this  study  was  initiated. 
Analysis  of  subsequent  data  indicated 
that  little,  if  any,  improvement  in  per¬ 
formance  occurred  as  a  function  only  of 
practice. 
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Mean  First  Good  lnit.  Time  Mean  First  Good  Init,  Time  Mean  First  Good  Init.  Time 


GROUP  SIZE 


EFFECTS 


Group  Size 
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Earlier  in  this  report  it  was 
mentioned  that  shifts  in  operator  strat¬ 
egy  may  be  reflected  in  seemingly 
aberrant  variability  in  the  performance 
measures.  Changing  the  composition  of 
the  team  by  varying  the  number  of  opera¬ 
tors  should  also  provoke  a  change  in 
strategy.  The  quality  of  the  information 
environment  should  also  affect  the  way 
in  which  the  task  is  approached.  In 
Figure  10  the  resulting  shifts  in  trade¬ 
off  between  Percent  Reinitiation  Actions 
That  Were  Correct  versus  Percent  Aircraft 
Correctly  Initiated  are  portrayed.  Note 
that  when  the  information  environment  is 
poor  (i.e.,  blip/scan  =  .5)  the  single 
operator  seems  to  simply  do  the  best  he 
can  and  performance  on  these  two  measures 
is,  accordingly,  positively  correlated. 
The  tendency  to  trade  off  these  two 
activities  increases  as  the  information 
environment  improves,  as  evidenced  by  the 
negative  correlations  at  the  higher  blip/ 
scan  values. 


Group  Size 


Figure  10.  Percent  Reinitiation  Actions 
Correct  versus  Percent  Aircraft 
Correctly  Initiated. 


Recommendations  and  conclusions  were: 

•  Based  on  these  preliminary  data, 
two  major  engineering  design  recommenda¬ 
tions  can  be  made  for  digital  radar  sur¬ 
veillance  systems.  Both  operator  and 
surveillance  system  performance  are 
predominantly  driven  by  blip/scan  ratio 
and  not  appreciably  affected  by  trail 
length.  Engineering  efforts  to  increase 
blip/scan  ratio  should  have  significant 
payoffs  in  enhancement  of  surveillance 
system  performance  and  ultimately  overall 
operational  effectiveness. 


•  The  second  recommendation  concerns 
the  assignment  of  surveillance  areas  to 
multiple  surveillance  technicians.  On  a 
cursory  analysis,  one  might  be  led  to 
conclude  that  a  2-man  team  might  best 
split  their  functions  such  that  one  would 
be  responsible  for  initiate  actions  and 
the  other  for  reinitiate  actions.  This 
was  the  procedure  adopted  by  the  subjects 
in  this  study.  Three  operators  might 
similarly  partition  their  functions  commen¬ 
surate  with  the  radar  data  environment  of 
the  moment.  The  performance  data  did 
support  the  hypothesis  that  multiple  opera¬ 
tors  do  generate  improved  surveillance 
system  performance,  but  not  to  the  extent 
one  might  anticipate.  One  reason  for  this 
may  be  that  operators  who  monitor  and  per¬ 
form  only  part  of  the  sensing  task  required 
for  initiation  and  maintenance  are  less 
likely  to  be  aware  of  the  total  past  and 
present  activity  of  individual  radar  trails- 
and  have  not  been  forced  to  attend  the  total 
situation.  See  Figure  11,  comparing  1-, 

2-  and  3-man  groups  using  a  human  perform¬ 
ance  measure  of  mean  first  good  initiate 
time  with  a  systems  effectiveness  measure 
of  overall  good  tracking  for  different 
levels  of  probability  of  track  failure . 

•  An  alternative  to  the  functional 
allocation  of  duties  is,  of  course,  to 
assign  each  operator  his  own  (smaller) 
surveillance  area  and  demand  that  he  have 
total  responsibility  for  all  activities 
in  that  area. 


of  Track  Failure  on  MFGIT  and  GT 

THE  OPERATOR  AS  A  RESOURCE  MANAGER 

The  ultimate  goal,  of  course,  of  any 
command-control  system  is  to  commit 
deterrent  or  destructive  resources  which 
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most  effectively  minimize  a  developing 
threat.  Assuming  that  the  surveillance 
function  achieves  reasonable  levels  of 
target  detection,  acquisition,  and  track¬ 
ing  of  the  threat,  the  weapons  direction 
function  must  be  performed  equally  as 
effectively  to  win  the  encounter.  The 
ability  of  the  operator  to  interface  with, 
gain  access  to,  and  manipulate  information 
contained  in  computer  storage  is  as 
important,  or  even  more  important,  in  the 
weapons  direction/resource  management 
function  as  it  is  in  the  surveillance 
function. 

In  one  of  the  Air  Force’s  new  command- 
control  systems,  a  critical  design  issue 
revolved  around  the  most  efficient  means 
by  which  weapon  directors  could  assign 
interceptors  to  an  incoming  threat  of 
attacking  bombers  via  interactive  computer 
techniques . 

Billy  Crawford  and  William  Pearson, 
from  our  group,  recently  developed  a  series 
of  simulation  efforts  to  evaluate  the  most 
effective  interactive  technique  (computer 
control)  for  directing  weapons  under 
varying  conditions  of  simulated  threats. 

Response  Rate  Study 

Again  the  operator  was  stationed  at 
our  IBM  2250  consoles  which  were  driven 
by  the  IBM  360-40H  system.  The  console 
was  modified  to  include  a  track  ball 
control  which  controlled  a  cursor  on  the 
CRT.  A  diagrammatic  representation  of 
the  console  displays  and  controls  is 
depicted  in  Diagram  B. 

There  were  three  sets  of  experimental 
variables  used  in  this  study:  the  mode 
of  control,  the  target  introduction  rate, 
and  travel  distance  between  a  home  point 
for  a  cursor  and  the  interceptor/target 
locations .  The  mode  of  control  included 
two  versions  of  a  light  pen  (LP-1  and 
LP-2)  where  LP-1  was  the  light  pen  nor¬ 
mally  found  on  the  2250  and  LP-2,  a 
modified  version  which  achieved  an 
optically  sensitive  area  of  approximately 
one  mm.  A  third  control  mode  was  the 
track  ball  which  controlled  cursor  move¬ 
ment  on  the  scope.  Target  presentation 
rate  was  .75  seconds,  1.68  seconds,  2.85 
seconds,  and  4.0  seconds  between  success¬ 
ive  targets  introduced  on  the  scope.  The 
mean  travel  distance  between  cursor  and 
target  varied  from  .5  inches,  2  inches 
and  5  inches. 


TRACK  BALL 
CURSOR  POSITION 

Diagram  B.  CRT  Display  and  Controls  for 
Weapons  Direction  Simulation 


Dependent  measures  used  in  the  study 
included  probability  of  committing  a 
response  (subjects  were  instructed  to 
only  commit  information  assignments  to 
the  computer  if  he  believed  his  cursor  or 
light  pen  was  aligned  on  the  target), 
response  time,  and  the  distance  between 
the  target  on  which  the  subject’s  response 
registered. 


The  results  are  respectively  repre¬ 
sented  for  each  of  the  measures  in  Tables 
V,  VI,  and  VII.  From  Table  V  it  can  be 
seen  that  the  operator  is  willing  to 
commit  a  response  with  roughly  a  30  per¬ 
cent  greater  probability  for  light  pens 
than  for  track  ball.  Table  VI  shows  that 
the  track  ball  takes  about  55-65  percent 
longer  to  commit  responses  and  from  Table 
VII  the  average  final  position  error  is 
more  than  twice  as  large  with  the  track 
ball  than  the  modified  pen  (LP-2). 


These  findings  certainly  verify  one’s 
intuitive  human  engineering  judgment  that  a 
free  or  random  access  interactive  control 
(light  pen)  provides  superior  data  query 
and  assignment  capability  than  does  a  posi¬ 
tion  constrained  query  device  (track  ball). 

To  verify  these  preliminary  findings, 
a  higher  fidelity  weapons  direction  simula¬ 
tion  was  developed. 
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TABLE  V 


Weapons  Direction  Simulation 


AVERAGE  PROBABILITY  OP  RESPONSE  COMMITMENT  FOR  TOE  EXPERIMENTAL  COKDITICNS 


Mode 

Time  Between 

Targets 

Distance 

0.5  2.0 

5.0 

Average 

LP-1 

.75 

.487 

.35 

.162 

♦  79 

1.68 

.912 

.912 

.862 

2.85 

.9500 

.950 

.938 

4.00 

.9750 

.962 

.975 

UP -2 

.75 

.312 

.387 

.150 

.77 

1.68 

.900 

.925 

.875 

2.85 

.950 

.962 

.925 

4.00 

.962 

.950 

.912 

TB 

.75 

.262 

.212 

.212 

.50 

1.68 

.662 

.175 

.075 

2.85 

.975 

.637 

.287 

4.00 

1.000 

.925 

.562 

TABLE  VI 

AVERAGE  RESPONSE  TIME  IN  SECONDS  FOR  THE  EXPERIMENTAL  CONDI  TICKS 


Time  Between 

Distance 

Mode 

Targets 

0.5 

2.0 

5.0 

Average 

LP-1 

.75 

.691 

.684 

.739 

.974 

1.68 

.895 

.981 

1.110 

2.85 

.939 

1.074 

1.234 

4.0 

1.005 

1.092 

1.244 

LP-2 

.75 

.673 

.684 

.717 

1.142 

1.68 

.921 

1.041 

1.197 

2.85 

1.184 

1.243 

1.509 

4.0 

1.313 

1.451 

1.774 

TB 

.75 

.636 

.654 

.640 

1.779 

1.68 

1.291 

1.519 

1.571 

2.85 

1.743 

2.341 

2.676 

4.0 

1.894 

2.863 

3.524 

TABLE  VII  C* 

(U 

AVERAGE  DISTANCE  IN  INCHES  BETWEEN  TARGET  AND  RESPONSE  MODE  INDICATOR  5 


Mode 

Time  Between 
Target® 

0.5 

Distance 

2.0  5.0 

Average 

LP-2 

.75 

.50 

1.28 

4.34 

.71 

1.68 

.12 

.22 

.69 

2.85 

.09 

.14 

.43 

4.0 

.07 

.16 

.50 

TB 

.75 

.58 

2.01 

4.94 

1.84 

1.68 

.31 

1.85 

4.94 

2.85 

.08 

.93 

3-79 

4.0 

.07 

.29 

2.25 

Although  not  a  complete  representation 
of  the  weapons  director^  job  in  current 
command-control  systems  (i.e.,  BUIC  III), 
the  key  tasks  of  interceptor-to-target 
assignments,  heading  change,  altitude  change, 
and  commit -\onc ommit  (drop)  were  simulated. 
Since  one  of  the  major  questions  dealt  with 
track  ball  capability  to  perform  these 
functions  under  peak  threat  load  conditions, 
three  levels  of  task  loading  were  employed: 
(l)  two  waves  of  8  fakers,  each  with  2 
interceptors  capable  of  kill;  (2)  one  wave 
of  16  fakers,  with  A  interceptors  capable 
of  kill;  and  (3)  one  wave  of  20  fakers 
against  4  fighters.  Faker  speed  was  500 
knots  and  interceptor  speed,  620  knots. 
Certainly  one  can  argue  that  the  threat/ 
interceptor  loads  were  not  necessarily  on 
an  increasing  ordinal  scale,  or  even  a 
unidimensional  scale;  however,  they  did 
achieve  differing  and  perhaps  realistic 
operational  workload  conditions . 


Figure  12  summarizes  the  findings  of 
this  study  using  the  average  distance  at 
which  the  threats  were  killed  before  their 
bomb  release  line  (BRL).  Most  operational 
command -control  personnel  agree  to  the 
distance  from  BRL  as  an  acceptable  systems 
effectiveness  measure  assuming  certain 
invasion  geometry/time  relationships. 


Figure  12.  Summary 


SUMMARY 

In  this  paper  we  have  attempted  to 
address  problems  of  evaluating  the 
effectiveness  of  several  existing  and 
advanced  radar  surveillance  and  command- 
control  systems .  This  effectiveness 
depends  largely  on  the  ability  of  the 
operator  to  acquire  a  nonimaging  and 
digitally  displayed  target  and  to 
identify  what  he  perceives  as  being 
different  from  nontarget  returns  using 
the  time  course  history  of  the  returns. 

He  must  then  assign  this  information  to 
a  computer  programmed  to  automatically 
track  the  progress  of  the  target  across 
the  surveilled  area.  The  operator  may 
perform  this  coordinate  position  identi¬ 
fication  and  subsequent  track  initiation 
with  a  light  pen  or  through  cursor  posi¬ 
tioning  with  a  track  ball,  joystick,  or 
other  means .  The  weapons  direction 
function  must  be  performed  as  effectively 
as  the  surveillance  function  in  order  to 
achieve  maximum  defense  capability.  The 
ability  of  the  operator  to  efficiently 
interact  with  the  computer -stored  informa¬ 
tion  about  the  acquired  and  tracked 
targets  will  depend,  to  a  great  extent, 
on  the  computer  interactive  device  used. 

Several  simulation  studies  were 
reviewed  which  were  aimed  at  assessing 
the  operator's  capabilities  as  a  threat 
detector  and  resource  manager  under  vary¬ 
ing  conditions  of  radar  system  performance 
and  threat -environment  dynamics. 

Some  generalizable  principles  have 
evolved  from  these  simulations: 

•  The  human  operator  is  more 
effective  than  originally  thought  in 
acquiring  targets  in  highly  cluttered 
digital  radar  systems  even  under  high 
information  load  conditions.  That  is, 
the  Pd  (probability  of  penetrator 
detection)  decreases  from  a  maximum  of 
1.00  (all  penetrators  detected)  for  a 
target  introduction  rate  of  one/minute 
to  a  minimum  of  .9  for  four  penetrators/ 
minute.  A  decrease  of  only  10  percent; 
the  operator  is  an  excellent  perceptual 
filter. 

Again  from  Figure  12,  it  appears  that 
the  track  ball  is  inferior  to  the  light  pen 
versions  in  the  order  of  5-15  percent  across 
"lead"  conditions — not  as  profound  a 
difference  as  seen  in  the  response  time 


study.  That  is  to  say,  the  penetrators  got 
closer  to  the  bomb  release  line  before 
destroyed  under  all  load  conditions  with 
the  track  ball  than  with  the  light  pens. 

•  The  more  effort  the  surveillance 
operator  devotes  to  cleaning  up  uninitiated 
tracks  on  his  display — keeping  the  computer 
updated  for  all  existing  tracks,  the  better 
the  chance  he  has  of  picking  up  new  incoming 
penetrators — confirms  basic  vigilance  re¬ 
search  findings. 

•  Simply  adding  surveillance  opera¬ 
tors  may  introduce  coordination/communica¬ 
tion  requirements  which  increase  as  tracking 
system  degradation  increases  and  outweighs 
the  seemingly  positive  effects  of  dividing 
the  workload. 

•  Adding  additional  operators  has 
little  effect  on  overall  good  tracking  of 
penetrators  throughout  a  60-minute  invasion 
threat;  however,  to  detect,  initiate  and 
eventually  commit  interceptor  resources, 
more  operators  are  a  distinct  advantage 
since  they  detect  and  initiate  tracks 
sooner. 

•  As  general  human  engineering 
principles  might  indicate,  a  free  or  random 
access  computer  interactive  capability  is 
superior  for  performing  resource  management 
functions  to  position-cursor  dependent 
devices  like  track  balls  and  joysticks. 
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